-
Notifications
You must be signed in to change notification settings - Fork 24.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ILM rollover within the same availability zone #62194
Comments
Pinging @elastic/es-core-features (:Core/Features/ILM+SLM) |
This is not just applicable to ILM, users have raised questions about this in the case where you might be patching a node in one AZ and spinning up another node to take its place. Copies will be coming from the primary shard, whichever zone that might be in, rather than the still available replica in the local zone. |
Relates #63519. |
I'm removing the |
Pinging @elastic/es-data-management (Team:Data Management) |
I think this duplicates the (resolved) #73496: data transfer to/from snapshots does not incur cross-zone network traffic costs. Therefore I'm closing it. |
is it possible to do ILM rollovers by snapshotting to S3 from a hot node and then recovering on a warm node? instead of a node to node direct data transfer? if so that'd be amazing, and this issue relates - otherwise, not sure that one duplicates this issue. |
Yes that's right, that's what #73496 does. |
Here's an example for an index with one replica, while using force_awareness node attributes to spread the shards across 2x AZ's via the
zone
property. Right now, indexing / ILM are implemented as such:Indexing:
ILM:
This means there are up to 3x network boundary transitions for the lifecycle of a document 🙀
And you better believe Amazon's making hay while the sun's shining at $0.01 / GB transferred across zones.
A solution to this might be an added config option to prefer replication on rollover within the same availability zone (where data transfer is free), according to: https://www.elastic.co/guide/en/elasticsearch/reference/7.8/modules-cluster.html#forced-awareness
i.e., ILM changes the index's
routing.allocation.require
attribute to start the rollover process, followed by the primary and replica shards getting bulk transferred in parallel to nodes with the new attribute, attempting to pick destinations in the samezone
they originated from.Unless something like this is already possible, and if so I'd love to hear about it! Thanks for the consideration!
The text was updated successfully, but these errors were encountered: