XCM export_message() report errors back to statemine (to release blocked funds) #2443

EmmanuellNorbertTulbure · 2023-03-07T16:18:37Z

To recover from bridge errors.

serban300 · 2023-03-23T08:11:05Z

Taking a look.

bkontur · 2023-03-23T09:27:12Z

maybe this looks interesting XcmQueryHandler paritytech/polkadot#6900

serban300 · 2023-03-24T14:31:15Z

Looks like there are 2 error management mechanisms in XCM:

Error reporting
Asset trap

More details here: https://polkadot.network/blog/xcm-part-three-execution-and-error-management

Trying to understand more in-depth how they work, how we can use them and at which chains.

serban300 · 2023-03-27T10:08:29Z

From what I understand, the Asset Trap works like this: if an XCM message fails while there are some assets in the holding register, those assets will be saved and will be kept on chain.

After that they can be reclaimed using the ClaimAsset instruction. The ClaimAsset instruction removes them from the on-chain storage and puts them back in the holding register enabling the caller to perform other XCM operations on them (for example to move them into his account).

Trying to simulate this scenario over a bridge and see exactly how it works.

serban300 · 2023-03-27T10:45:22Z

maybe this looks interesting XcmQueryHandler paritytech/polkadot#6900

Thanks ! This seems related to Error reporting, which we might need as well. Will investigate it also. We might need a combination of Error reporting and Asset trap.

serban300 · 2023-03-30T08:44:32Z

Managed to test this locally on rockmine2.

Executed on rockmine2 using the Alice dev account:

RuntimeCall::PolkadotXcm(pallet_xcm::Call::<Runtime>::execute {
	message: Box::new(xcm::VersionedXcm::V3(Xcm(vec![
		Instruction::<RuntimeCall>::WithdrawAsset(
			vec![MultiAsset {
				id: AssetId::Concrete(MultiLocation {
					parents: 1,
					interior: Junctions::Here,
				}),
				fun: Fungibility::Fungible(1000000000000),
			}]
			.into(),
		),
	]))),
	max_weight: Weight::from_parts(10000000, 0),
});

This withdraws 1 KSM from Alice's account in rockmine2. Since we don't do anything with this amount, it is present in the holding register after executing the xcm instruction and the asset trap mechanism will save it on-chain in the rockmine2 storage

Executed on rockmine2 using the Alice dev account:

RuntimeCall::PolkadotXcm(pallet_xcm::Call::<Runtime>::execute {
	message: Box::new(xcm::VersionedXcm::V3(Xcm(vec![
		Instruction::<RuntimeCall>::ClaimAsset {
			assets: vec![MultiAsset {
				id: AssetId::Concrete(MultiLocation {
					parents: 1,
					interior: Junctions::Here,
				}),
				fun: Fungibility::Fungible(1000000000000),
			}]
			.into(),
			ticket: MultiLocation { parents: 0, interior: Junctions::Here },
		},
		Instruction::<RuntimeCall>::DepositAsset {
			assets: MultiAssetFilter::Wild(WildMultiAsset::All),
			beneficiary: MultiLocation {
				parents: 0,
				interior: Junctions::X1(Junction::AccountId32 {
					network: None,
					id: hex_literal::hex!(
						"d43593c715fdd31c61141abd04a99fd6822c8558854ccde39a5684e7a56da27d"
					)
					.into(),
				}),
			},
		},
	]))),
	max_weight: Weight::from_parts(10000000, 0),
});

This claims the 1 KSM that was trapped and transfers it back to Alice's account on rockmine2.

However there are 2 things to note:

The trapped assets are stored in a hashed form on-chain. So we need to claim the entire asset. If 1000000000000 units were trapped we can't make a partial claim for 1000 for example, because it won't be recognized. So we need to know the exact amount that was trapped and this can be affected by fees that are payed on the way. We need to check if and how we can find out the exact trapped amount in case of a failed xcm transaction over the bridge.
By mistake I happened to execute a ClaimAsset instruction followed by a DepositAsset to a wrong account. I got an xcm_error: FailedToTransactAsset("AccountIdConversionFailed"), but I think I lost the asset. At least it didn't seem to be trapped again and I couldn't find a way to claim it. I will check this further.

Other things still left to be done:

also try this scenario over the bridge
try to understand what happens when this happens over the bridge. Suppose we have 1 KSM that was sent from Rockmine2 over the bridge and was trapped at WococoBridgeHub or Wockmint. What should we do with it ? Should we have a KSM account at WBH and Wockmint for asset trap claims ? Or should the user have one such account ? Or should we try to send the asset back to Rockmine2 somehow ?

serban300 · 2023-03-30T10:07:58Z

There is a logged issue related to losing assets when DepositAsset fails: paritytech/polkadot-sdk#912

The problem is that the assets are removed from the holding register and in case of an error they are not put back. And it doesn't happen only for DepositAsset.

A solution would be to use a custom AssetTransactor that accounts for these lost assets.

serban300 · 2023-04-03T06:52:45Z

Another thing to pay attention to is this condition here.

There is a "barrier" condition that is checked before processing the XCM message. If this check fails, the XCVM won't be even started, post_process() will not be called and the assets will be lost (they won't be trapped).

We need to pay attention how to craft the messages in order to avoid triggering this barrier.

serban300 · 2023-04-03T15:30:07Z

maybe this looks interesting XcmQueryHandler paritytech/polkadot#6900

Thanks ! This seems related to Error reporting, which we might need as well. Will investigate it also. We might need a combination of Error reporting and Asset trap.

Started to look into Error reporting. From what I understand both ReportError and QueryResponse can only report the status. The assets will still remain trapped on the chain where the execution error occured. And we wouldn't be able to do a combination of ClaimAssets + ReportError/QueryResponse in order to report back to the source chain and unlock the assets automatically, because the Asset Trap is executed after the error handler or the appendix. And probably we wouldn't know how many assets to claim anyway.

Another interesting thing is that the AssetTrap is configurable. Thinking if we could use that.

serban300 · 2023-04-04T13:05:58Z

Trying to summarize the entire flow, to have a better overall picture. This is how it looks like at the present time. It might change. Some items still need research. I will update this comment as the research progresses.

Let's take for example an asset transfer from Rockmine -> BridgeHub Rococo -> BridgeHubWococo -> Wockmint

On Rockmine:

The user calls an extrinsic. This extrinsic reserves the assets, prepares the XCM message and sends the XCM message to BridgeHub Rococo. We don't execute any XCM instruction here. The message will be executed as it reaches the next parachains. The message contains a ReserveAssetDeposited instruction that will be executed on Wockmint, which will put the assets in the holding register and a DepositAsset instruction that will move them in the user's account on Wockmint.

Possible failures that would involve the assets: BridgeXcmSender::validate() and BridgeXcmSender::deliver() are called after reserving the assets. Maybe some other small methods. Anyway, nothing related to the actual XCM message. The message will be executed as it reaches the next parachains.

Mitigations: I think we can call BridgeXcmSender::validate() before reserving the assets, and maybe some other calls. But BridgeXcmSender::deliver() needs to be executed after, no matter what. Anyway, we should handle the errors in the code and unlock the assets.

On BridgeHubRococo:

Here the XCM message is received and we execute the ExportMessage instruction, which should send the message over the bridge to BridgeHubWococo, encoded as a blob.

Possible failures that would involve the assets: Here, the ReserveAssetDeposited is not called yet, so any failure would lead to losing the assets. They won't even be caught by the Asset Trap, because they are not in the holding register. Failures could involve the execution of ExportMessage, or any bridge failures.

Mitigations: TODO, STILL RESEARCHING. Here we might be able to send a QueryResponse back to Rockmine if the XCM fails.

On BridgeHubWococo:

Here the XCM message is received as an encoded blob, the bridge logic decodes it and sends it to Wockmint. No XCM instruction is executed.

Possible failures that would involve the assets: Here, no XCM instruction is executed, so any failure would lead to losing the assets. They won't even be caught by the Asset Trap, because they are not in the holding register. Mainly there can be problems decoding the blob, or sending the message to Wockmint, but I think the chances to get an error here are slim.

Mitigations: TODO, STILL RESEARCHING

On Wockmint:

Here we finally execute the last part of the XCM message: ReserveAssetDeposited + DepositAsset.

Possible failures that would involve the assets: Failures can be related to XCM message problems or the configuration of the pallet_xcm here. Maybe fees withdrawal, bad origin, etc. If any error occurs after ReserveAssetDeposited, the assets will be caught by the Asset Trap and an Event will be stored related to that. The event will contain the exact amount of assets that were traped.

Mitigations: Rockmint's account (or whatever the owner of the message was) will be able to execute a ClaimAsset instruction and reclaim the assets. Also a QueryResponse instruction could be sent back to Rockmine. The question is if we could automate the asset recovery here.
TODO, STILL RESEARCHING

Apart from all these we have to be mindful of the XCM barrier on each runtime where XCM instructions are executed: #2443

serban300 · 2023-04-12T07:03:58Z

As per the guidance from the XCM team:

XCM is designed to not fail at low level (e.g queues are essentially unlimited, overweight messages stick around). And the bridge should be designed in the same way
Failures should only come from intentional ignorance. (E.g. not having assets to pay the fees, depositing assets which the chain cannot represent, attempting to withdraw assets which are not in the Hold register or attempting to send or export a message to a location which is unsupported by the chain, etc )
The initiator of the action should perform all the checks before sending and ensure they are not ignorant
We should offer some APIs/RPCs in order to offer the possibility to check offchain, before sending, that the message will go through. Need to discuss exactly what. But probably some of the things mentioned above + checking the state of the message queues

serban300 · 2023-05-10T15:06:34Z

Considering that the bridge message queue satisfies all the XCM design assumptions (tracked in paritytech/polkadot#2038), I can think of a single issue here: halting the bridge. If the bridge is halted, the in-flight messages might get lost. Thinking of solutions.

* remove reference to remote-externalities * update reference to remote-externalities * s/remote-ext/frame-remote-externalities * copy the fix from paritytech/cumulus@747400a * update cargo.lock

EmmanuellNorbertTulbure · 2023-09-01T09:54:26Z

@serban300 document the conclusion

serban300 · 2023-09-01T11:24:48Z

The conclusion is the one above: #2443 (comment)

Closing the issue

EmmanuellNorbertTulbure assigned serban300 Mar 7, 2023

acatangiu changed the title ~~Export message report errors back to statemine (to release blocked funds)~~ XCM export_message() report errors back to statemine (to release blocked funds) Mar 8, 2023

This was referenced Apr 4, 2023

Report message dispatch status to source chain #2022

Closed

Report msg bridge dispatch error to XCM origin #2029

Closed

serban300 mentioned this issue Aug 25, 2023

Check that the bridge satisfies XCM design assumptions #2433

Closed

acatangiu mentioned this issue Aug 25, 2023

[P-K Bridge] BHP<>BHK XCM message passing (dynamic fees) #2430

Closed

5 tasks

svyatonik referenced this issue Jul 17, 2023

Companion of Substrate PR 12837 (#1944)

c9d3cbc

* remove reference to remote-externalities * update reference to remote-externalities * s/remote-ext/frame-remote-externalities * copy the fix from paritytech/cumulus@747400a * update cargo.lock

EmmanuellNorbertTulbure transferred this issue from paritytech/parity-bridges-common Aug 25, 2023

the-right-joyce transferred this issue from paritytech/polkadot-sdk Aug 25, 2023

serban300 closed this as completed Sep 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XCM export_message() report errors back to statemine (to release blocked funds) #2443

XCM export_message() report errors back to statemine (to release blocked funds) #2443

EmmanuellNorbertTulbure commented Mar 7, 2023

serban300 commented Mar 23, 2023

bkontur commented Mar 23, 2023

serban300 commented Mar 24, 2023

serban300 commented Mar 27, 2023 •

edited

Loading

serban300 commented Mar 27, 2023

serban300 commented Mar 30, 2023

serban300 commented Mar 30, 2023

serban300 commented Apr 3, 2023 •

edited

Loading

serban300 commented Apr 3, 2023 •

edited

Loading

serban300 commented Apr 4, 2023 •

edited

Loading

serban300 commented Apr 12, 2023

serban300 commented May 10, 2023

EmmanuellNorbertTulbure commented Sep 1, 2023

serban300 commented Sep 1, 2023

XCM export_message() report errors back to statemine (to release blocked funds) #2443

XCM export_message() report errors back to statemine (to release blocked funds) #2443

Comments

EmmanuellNorbertTulbure commented Mar 7, 2023

serban300 commented Mar 23, 2023

bkontur commented Mar 23, 2023

serban300 commented Mar 24, 2023

serban300 commented Mar 27, 2023 • edited Loading

serban300 commented Mar 27, 2023

serban300 commented Mar 30, 2023

serban300 commented Mar 30, 2023

serban300 commented Apr 3, 2023 • edited Loading

serban300 commented Apr 3, 2023 • edited Loading

serban300 commented Apr 4, 2023 • edited Loading

serban300 commented Apr 12, 2023

serban300 commented May 10, 2023

EmmanuellNorbertTulbure commented Sep 1, 2023

serban300 commented Sep 1, 2023

serban300 commented Mar 27, 2023 •

edited

Loading

serban300 commented Apr 3, 2023 •

edited

Loading

serban300 commented Apr 3, 2023 •

edited

Loading

serban300 commented Apr 4, 2023 •

edited

Loading