CodSpeed reporting wrong numbers #82

overlookmotel · 2024-07-16T12:14:37Z

The problem

I've seen quite a few times recently that benchmark measures CodSpeed gives on PRs are erroneous.

This is a problem when doing perf work, as you can't tell if what you're doing is good or not.

e.g. PR oxc-project/oxc#4214 showed initially a giving 0 speed-up, but then benchmarks re-ran after the PR below it in the stack was merged, and suddenly it shows 6% perf improvement. https://codspeed.io/oxc-project/oxc/branches/07-12-perf_semantic_reduce_lookups

That's wrong. The PR gives 0 perf improvement.

Reason was that in the last run, CodSpeed did the comparison to 2 commits back (3016f03), rather than 1 back. So 6% result shown included the perf boost of oxc-project/oxc#4213 which is the commit that preceded it.

Why?

I am not sure why this has started happening recently. Could be:

Changes at CodSpeed's end.
Caused by our switch to using Graphite merge queue.

Solutions

Raise with CodSpeed.
If they can't fix, investigate if we can handle it somehow at our end.

Because we intercept and store bench results and upload them to CodSpeed our end, we could potentially get our Github action to check that benchmarks for previous commit have completed and been uploaded to CodSpeed already, before submitting results for current commit. If not, wait until they are.

overlookmotel · 2024-07-26T11:11:24Z

Appears to be a CodSpeed thing, unrelated to Graphite merge queue.

Another example from today: oxc-project/oxc#4476
https://codspeed.io/oxc-project/oxc/branches/07-26-perf_sourcemap_pre_allocate_string_buf_while_encoding

Codspeed gets "stuck" comparing to "base" commit ccb1835, whereas the PR was rebased on latest main, so the merge commit for that PR that benchmarks are running on is based on latest main 42a2519.

overlookmotel · 2024-07-26T11:47:20Z

Have posted on CodSpeed support Discord: https://discord.com/channels/1065233827569598464/1065686090452828251/threads/1266360428930535526

Boshen · 2024-07-27T01:12:50Z

The conclusion is base should always trigger benchmark on any Rust changes. This case is change in rustc version.

overlookmotel · 2024-07-29T17:11:34Z

I've moved this back to backlog and reopened.

I didn't take quite the same conclusion from this as Boshen. It seems to me that the problem is a race condition - if a PR completes its benchmark run before the latest commit on main completes its, then CodSpeed uses previous commit on main as the base for comparison.

We could fix this by adding a check to the upload action to make sure benchmarks aren't still running on base commit. If they are, wait for it to finish before uploading. As we're already handling uploading benchmarks ourselves for purpose of sharding, we can handle this quite easily.

I think this problem has become more common recently because of our use of Graphite merge queue means it's constantly merging stuff and running benchmarks in quick succession.

overlookmotel mentioned this issue Jul 26, 2024

perf(mangler): reduce unnecessary allocation oxc-project/oxc#4477

Closed

overlookmotel transferred this issue from oxc-project/backlog Jul 26, 2024

overlookmotel changed the title ~~Investigate CodSpeed reporting wrong numbers~~ CodSpeed reporting wrong numbers Jul 26, 2024

Boshen closed this as not planned Won't fix, can't repro, duplicate, stale Jul 27, 2024

overlookmotel transferred this issue from oxc-project/oxc Jul 29, 2024

overlookmotel reopened this Jul 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodSpeed reporting wrong numbers #82

CodSpeed reporting wrong numbers #82

overlookmotel commented Jul 16, 2024 •

edited

Loading

overlookmotel commented Jul 26, 2024 •

edited

Loading

overlookmotel commented Jul 26, 2024

Boshen commented Jul 27, 2024 •

edited

Loading

overlookmotel commented Jul 29, 2024 •

edited

Loading

CodSpeed reporting wrong numbers #82

CodSpeed reporting wrong numbers #82

Comments

overlookmotel commented Jul 16, 2024 • edited Loading

The problem

Why?

Solutions

overlookmotel commented Jul 26, 2024 • edited Loading

overlookmotel commented Jul 26, 2024

Boshen commented Jul 27, 2024 • edited Loading

overlookmotel commented Jul 29, 2024 • edited Loading

overlookmotel commented Jul 16, 2024 •

edited

Loading

overlookmotel commented Jul 26, 2024 •

edited

Loading

Boshen commented Jul 27, 2024 •

edited

Loading

overlookmotel commented Jul 29, 2024 •

edited

Loading