-
Notifications
You must be signed in to change notification settings - Fork 8.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Infra Monitoring] Warning about large index sets #120615
Comments
Pinging @elastic/infra-monitoring-ui (Team:Infra Monitoring UI) |
The original intent of https://www.elastic.co/guide/en/elasticsearch/reference/master/keyword.html#constant-keyword-field-type was to allow broad searches (like Granted, customer indices might not always these keywords set. This has me curious about general query timing information for the user. Something like:
It's a much bigger ask than your description of course. But maybe useful food for thought. |
@matschaffer That sounds very much like another idea @weltenwort had, about adding some kind of app specific profiling/diagnostics tool |
For some background information, we're mostly seeing problems with the alert query execution times. We've been discussing this in #98010 for a while and implemented partial optimizations. The log stream itself has shown pretty stable performance since its queries don't include any expensive aggregations and consistently apply time range filters. In in light of that, I wonder what we intend to solve with this issue? |
Upon some reflection I think this will be solved by us being able to offer curated views where the user can easily narrow down their index set to only show data for that view (and use that for the alerts they create too). If we can get Integrations to install these then it's all the easier. |
WIP
One of the factors that impact query performance is the number of indices targeted. Elasticsearch can narrow down this set based on if the index has any documents within the specified time range but if you have a large set of indices and they all have documents in the given time range Elasticsearch has to hit all of them, since it cannot know if an index has relevant data or not.
However, the user might know based on for example which filebeat modules they are using.
It would be good if we could surface feedback to the user, saying something along the lines of "the index pattern
filebeat-*
includes a lot of indices, do you want to narrow it down tofilebeat-{module}-*
?" or similar.In the Logs UI we have started adding some kind of validation of the Data View being selected for the Source Configuration, this could be a good area to build further on since today the Rule Type executors are affected by growing sets of indices to scan.
How does this all relate to the can match phase which should account for "presence of fields, value ranges and constant_keywords"
Note: This is likely to become more relevant when we can offer "saved views" to give users options to place narrow targets in different views in the same space
The text was updated successfully, but these errors were encountered: