PITR: Undo of ALTER TABLE #7135

bmatican · 2021-02-05T17:41:09Z

Some options we've discussed internally

as part of the snapshot restore operation, we can explicitly flow the old schema as well, so the TS atomically brings back its roksdb data AND updates its local schema as well
after the master rolls back, it can send explicit RPCs to all affected tables, as essentially an AlterTable operation, to change their local schemas -- this however could have issues, if the master state is behind the TS state, when TS start sending heartbeats, but before they get these Alter operations
change the TS side data format, to move the schema from the SuperBlock, into instead the rocksdb data itself, as a custom key -- this would allow us to get schema rollback for free, as part of the rocksdb snapshot restore

Summary: This diff adds logic to restore table schema. After this, we should be able to undo an ALTER TABLE operation! There are two important changes as part of this diff. 1) Restoring master side sys_catalog metadata. 2) Sending the restored version of the schema from the master to the TS, as part of the explicit command to restore the TS. As part of applying the restore operation on the master, we add new state tracking, which can do the diff between current sys_catalog state vs the state at the time at which we want to restore. This is done by restoring the corresponding sys_catalog snapshot into a temporary directory, with the HybridTime filter applied, for the restore_at time. We then load the relevant TABLE and TABLET data into memory and overwrite the existing rocksdb data directly in memory. This is safe to do because - It is done as part of the apply step of a raft operation, so it is already persisted and will be replayed accordingly at bootstrap, in case of a restart. - It is done on both leader and follower. Once the master state is rolled back, we then run the TS side of the restore operation. The master now sends over the restored schema information, as part of the Restore request. On the TS side, we update our tablet schema information on disk accordingly. Note: In between the master state being rolled back and all the TS processing their respective restores, there is a time window in which the master can receive heartbeats from a TS, with newer schema information than what the master has persisted. Currently, that seems to only lead to some log spew, but will be investigated later, as part of fault tolerance testing. Test Plan: ybd --gtest_filter SnapshotScheduleTest.RestoreSchema Reviewers: amitanand, bogdan Reviewed By: bogdan Subscribers: ybase Differential Revision: https://phabricator.dev.yugabyte.com/D11013

bmatican added the area/docdb YugabyteDB core features label Feb 5, 2021

bmatican assigned spolitov, amitanandaiyer and nspiegelberg Feb 5, 2021

bmatican mentioned this issue Feb 5, 2021

[docdb] PITR: Tracking issue #7120

Closed

bmatican mentioned this issue Mar 8, 2021

[docdb] Move tablet schema information into rocksdb #7550

Open

bmatican closed this as completed May 10, 2021

bmatican mentioned this issue May 17, 2021

[docdb] PITR: Investigate and add support for other types of ALTER TABLE for YSQL #8483

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PITR: Undo of ALTER TABLE #7135

PITR: Undo of ALTER TABLE #7135

bmatican commented Feb 5, 2021

PITR: Undo of ALTER TABLE #7135

PITR: Undo of ALTER TABLE #7135

Comments

bmatican commented Feb 5, 2021