Make cluster recovery near instantaneous if all shards are present and accounted for #6069

geekpete · 2014-05-06T23:58:36Z

When restarting a cluster from green state, each shard appears to undergo some form of checksum to verify it before bringing it online.

Is there a way to journal writes so that recovery is much much faster, in the way that the xfs filesystem does it.

Only review the data that was being written to at the time of the outage or shutdown so that only the in-progress write data needs to be checked.

For a clean shutdown, maybe a complete cluster restart command could tell all nodes to shutdown in a clean state then turn off, allowing a near instantaneous recovery on startup. Like stop allocation then flush all translogs, then shut down,etc.

Would just take lots of the pain out of cluster restarts.

Just an idea

nik9000 · 2014-07-17T18:38:10Z

@javanna, when I met you in Germany we talked about doing something about the slow recovery times. I'm wondering if there is anything I can do to help with that.

s1monw · 2014-07-17T19:12:07Z

@nik9000 we have improvements in the pipeline for this. I can't promise when we will start working on them or when they will land but what we essentially plan is to work out algorithmic parts to reduce the risk of full recovery from the primary shard even if they out of sync just a hand full of documents. I will try to update this issue once I have news about it. Thanks for pinging

nik9000 · 2014-07-17T19:26:55Z

Thanks for the reply! I'm happy to help work on it but I imagine it'd be faster to just have someone familiar with your ideas do it.

nik9000 · 2014-07-29T20:25:43Z

I'm feeling this pain today again while I do a rolling restart to pick up a plugin. And in two weeks when I'll be ready to upgrade to 1.3.1. Because the restart process is so slow I try to batch things that get picked up by the restart but that isn't really good from a "change one thing at a time" perspective.....

nik9000 · 2014-08-23T00:33:06Z

I figure I should poke this issue every time I do a full day cluster restart. Poke. I'm happy to work on this if someone who has thought more about this can share. At this point I figure sinking a couple weeks into speeding up cluster restarts will save me time in the long run.

clintongormley · 2014-08-23T11:49:33Z

Hi @nik9000. This improvement depends on the addition of "sequence numbers" (a feature that will enable a number of other improvements). We are currently experimenting with various approaches but rest assured, this issue is not being ignored.

nik9000 · 2014-11-12T14:30:15Z

@clintongormley since you poked me last night about outstanding work I planned to do, can I poke this one? I'd live to have this. In the middle of a cluster rolling restart that is taking two days..... Its thankfully quite boring but still requires some degree of babysitting.

clintongormley · 2014-11-12T14:33:53Z

@nik9000 we are working on the design for this one. it is in the top 3 on our list, but obviously complicated and not guaranteed. We'll update the issue as soon as we have more news.

geekpete · 2014-11-20T01:02:23Z

This near instantaneous recovery idea might also be applied to when a node drops out of the cluster but rejoins with all its data on disk still intact.

Instead of throwing all that data away, if it could be salvaged in some efficient manner so that only the outdated differences need to be transmitted for storage, this would save quite a lot of data transfer on large clusters. Would make failures recoverable in much shorter time periods.

bleskes · 2014-11-20T08:59:06Z

@geekpete yeah - the plan is to help with that as well - at least when the down time is planned. When it isn't planned things get slightly trickier as ES will start replicating as a soon as a node goes down - this is no way for it to know how long the down time will take.

bobrik · 2014-12-01T15:20:22Z

This is probably related to #8725

connieyang · 2015-01-20T18:49:18Z

We have an Elasticsearch cluster (as part of our ELK stack) in production and have experienced (6+ hours) for the cluster to turn green during a rolling restart. When will a fix for this be ready?

bleskes · 2015-01-20T22:02:15Z

@connieyang sorry to hear the pain. We are actively working on it. Sadly I can't promise any ETA at the moment.

cfeio · 2015-02-17T20:58:36Z

+1, We are also experiencing pains with cluster recover due to a large cluster size (6 TB cluster). It is taking hours to recover even after a planned maintenance restart. This feature would be a huge improvement!

shyem · 2015-02-19T18:04:47Z

+1, same here. With daily index, most of them are unchanged, yet it takes for ever to recover.

clintongormley · 2015-05-31T11:08:18Z

Closed by #11336

s1monw self-assigned this Jul 17, 2014

This was referenced Aug 8, 2014

Cluster doesn't accept new client nodes or update settings API calls while initializing or rebalancing shards. #2821

Closed

Two stage cluster restart to reduce recovery time #4043

Closed

clintongormley mentioned this issue Aug 25, 2014

Unassigned shards #7363

Closed

This was referenced Sep 5, 2014

Allow for 'grace period expiration' before shard reallocation? #3569

Closed

Add option to prevent recovering replica shards onto nodes that did not have them locally. #7288

Closed

clintongormley assigned bleskes and unassigned s1monw Nov 12, 2014

clintongormley mentioned this issue Nov 12, 2014

ElasticSearch needs faster, smarter recovery from downtime #8456

Closed

This was referenced Nov 28, 2014

Possible improvement for orderly shutdown #4248

Closed

Recovery should re-assign shards when a node re-joins the cluster #2908

Closed

clintongormley mentioned this issue Dec 1, 2014

Slow recovery after restart #8725

Closed

clintongormley mentioned this issue Jan 26, 2015

Use _source to reindex data after a mapping update #9413

Closed

clintongormley closed this as completed May 31, 2015

bobrik mentioned this issue Mar 8, 2016

Cancellation of shard relocation does not work in 2.2.0 #17019

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make cluster recovery near instantaneous if all shards are present and accounted for #6069

Make cluster recovery near instantaneous if all shards are present and accounted for #6069

geekpete commented May 6, 2014

nik9000 commented Jul 17, 2014

s1monw commented Jul 17, 2014

nik9000 commented Jul 17, 2014

nik9000 commented Jul 29, 2014

nik9000 commented Aug 23, 2014

clintongormley commented Aug 23, 2014

nik9000 commented Nov 12, 2014

clintongormley commented Nov 12, 2014

geekpete commented Nov 20, 2014

bleskes commented Nov 20, 2014

bobrik commented Dec 1, 2014

connieyang commented Jan 20, 2015

bleskes commented Jan 20, 2015

cfeio commented Feb 17, 2015

shyem commented Feb 19, 2015

clintongormley commented May 31, 2015

Make cluster recovery near instantaneous if all shards are present and accounted for #6069

Make cluster recovery near instantaneous if all shards are present and accounted for #6069

Comments

geekpete commented May 6, 2014

nik9000 commented Jul 17, 2014

s1monw commented Jul 17, 2014

nik9000 commented Jul 17, 2014

nik9000 commented Jul 29, 2014

nik9000 commented Aug 23, 2014

clintongormley commented Aug 23, 2014

nik9000 commented Nov 12, 2014

clintongormley commented Nov 12, 2014

geekpete commented Nov 20, 2014

bleskes commented Nov 20, 2014

bobrik commented Dec 1, 2014

connieyang commented Jan 20, 2015

bleskes commented Jan 20, 2015

cfeio commented Feb 17, 2015

shyem commented Feb 19, 2015

clintongormley commented May 31, 2015