Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include metric types in tsdb index and template mappings #316

Conversation

salvatore-campagna
Copy link
Contributor

@salvatore-campagna salvatore-campagna commented Sep 7, 2022

This PR includes the following changes:
1. extract the mappings which are common between the index and the data stream template (see index-mappings.json)
2. include the metric type for (almost) every field (skipped histogram fields like latencies and others)
3. fix some inconsistency around synthetic source settings
4. include a new downsampling challenge in the tsdb track (create an index, downsample it using 3 different intervals and run 3 date histogram aggregations on the downsampled indices).

NOTE: the downsample challenge uses a new operation (downsample) which is required to run the challenge (see Rally PR 1574).

This commit includes three changes:
1. extract the mappings which are common between the
   index and the data stream template
2. include the metric type for (almost) every field
   (skipped histogram fields like latencies and others)
3. fix some inconsistency around synthetic source settings
},
{
"operation": "date-histo-entire-range-1m",
Copy link
Contributor Author

@salvatore-campagna salvatore-campagna Sep 7, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am wondering if it makes sense to have these three aggregations here or if we want to just have them under an "aggs" challenge for the tsdb track (not part of the downsampling challenge). Maybe we can keep them until we have an "aggs" challenge for the tsdb track?

Copy link
Member

@nik9000 nik9000 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this looks good. We probably need the new rally operator first. But once we have it I'd be happy to get this run nightly as well.

tsdb/challenges/default.json Show resolved Hide resolved
@nik9000
Copy link
Member

nik9000 commented Sep 7, 2022

It'll want a review from a perf person as well!

"name": "date-histo-entire-range-1d",
"operation-type": "search",
"index": "tsdb-1h",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

did you intend

Suggested change
"index": "tsdb-1h",
"index": "tsdb-1d",

?

Copy link
Contributor

@DJRickyB DJRickyB left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This LGTM

Copy link
Member

@martijnvg martijnvg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

"date": {
"date_histogram": {
"field": "@timestamp",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just a question, do we also want to benchmark the time_series agg here? (as separate operation)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My plan is to have later a new "aggs" challenge in the tsdb track to benchmark aggregations. Those aggregations are there just to roughly evaluate the result of downsampling and I might remove them later once we have the new "aggs" challenge.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants