Refactor quality metrics tests to use fixture #3249

chrishalcrow · 2024-07-24T10:51:59Z

Simplify quality metrics tests, so that they mostly share two sorting_analyzers and wrapped all the tests in main(), which may be needed for ci multiprocessing(?).

This PR moves the sorting_analyzer_simple to conftest.py, which means it can be shared by all the quality metric test files (=> only has to be computed once). Moreover, we weren't using the PCs computed for this sorting_analyzer, so I got rid of that.

On my machine, the old tests took 3 mins 17 sec and the new ones 2 mins 5 sec. Note: the tests are basically the same.

Results from gh actions with n_jobs=2 (https://github.com/SpikeInterface/spikeinterface/actions/runs/10075290248):
mac3.9: 2:22
mac3.12: 2:17
win3.9: 2:09
win3.12: 2:10
ubu3.9: 2:24
ubu3.12: 2:32

Compared to old timing (https://github.com/SpikeInterface/spikeinterface/actions/runs/10063489408):
mac3.9: 2:39
mac3.12: 2:46
win3.9: 9:17
win3.12: 9:09
ubu3.9: 3:23
ubu3.12: 3:08

So: definitely helps the windows runs! Should keep checking to see if this test time is consistent across many runs.

src/spikeinterface/qualitymetrics/tests/conftest.py

zm711 · 2024-07-24T12:14:52Z

src/spikeinterface/qualitymetrics/tests/test_metrics_functions.py

@@ -570,27 +540,36 @@ def test_calculate_sd_ratio(sorting_analyzer_simple):

 if __name__ == "__main__":


I agree I think this is essential for certain Windows run. But it is super inconsistent which Windows. (for example I haven't needed to do if name == main yet, but others have.... So weird.

no way! 🤯 so it depends on the Windows version?

I think it is less "version" and more like version + sub-version + python version + CPU-type + IDE. Who knows what magic determines this.... But one of my co-workers has to do the if name == main, but I don't on my workstation.

I had responded but somehow dissapeared, "Haha lol the mysteries of multiprocessing, very interesting!"

src/spikeinterface/qualitymetrics/tests/test_quality_metric_calculator.py

JoeZiminski

Hey @chrishalcrow this is really nice, great to centralise these fixtures! and find out some interesting things about pytest and multiprocessing along the way!

src/spikeinterface/qualitymetrics/tests/conftest.py

src/spikeinterface/qualitymetrics/tests/test_metrics_functions.py

zm711 · 2024-07-25T16:15:59Z

@chrishalcrow,

Why are we dropping the PC calculation--? I need to look at what the actual test is checking, but without pcs calculated we wouldn't be able to test pc metrics.

I will say in general on my windows machine that doing the PCA is the slowest step. So maybe the scipy implementation on Windows is the thing that is so variable. So removing the PC got rid of that variability, but I wonder if we need that calculation.

chrishalcrow · 2024-07-28T07:57:36Z

@chrishalcrow,

Why are we dropping the PC calculation--? I need to look at what the actual test is checking, but without pcs calculated we wouldn't be able to test pc metrics.

I will say in general on my windows machine that doing the PCA is the slowest step. So maybe the scipy implementation on Windows is the thing that is so variable. So removing the PC got rid of that variability, but I wonder if we need that calculation.

We do still test them using the small_sorting_analyzer, rather than the sorting_analyzer_simple, in test_calculate_pc_metrics.

The tests are a bit hodgepodge, but the aim of this PR was just to try and get the times consistent across OSs, and simplify the structure.

zm711 · 2024-07-29T11:00:18Z

Then my guess for time is that PCA is causing variability. If we aren't testing it here then removing it gets us the speed up (again my guess)--you could add it back in see if we slowdown and then remove it before actual merge.

JoeZiminski · 2024-09-10T13:06:11Z

Hey @chrishalcrow @zm711 are we almost done here? (sorry @zm711 I accidentally clicked to re-request your review when I meant to click myself). I guess we can continue the code-comment discussion, but maybe it is worth porting it to an issue.

Otherwise, it's just to see if reintroducing PC metric calculating in the sorting analyzer fixture brings up the test time, to see if that was the culprit?

zm711 · 2024-09-10T13:16:24Z

What is the actual change here? Deleting Sam's/Alessio's testing is +/-; And removal of PCA. I think I would actually prefer to see if PCA is the culprit here rather than removal and never really understand why the tests shortened.

JoeZiminski · 2024-09-10T14:09:39Z

Yes good point worth summarising as the debug code changes can be reverted, to summarise (let me know @chrishalcrow if I missed anything):

debug lines deleted (let's revert, but +1 to keep them commented, will message above)
get_sorting_analyzer() factored into conftest.py to avoid reuse (+1)
unused pca metric calculation removed (let's see if that did caused the slow down, but even if it didn't, +1 to remove it as it's unused)

alejoe91 · 2024-09-11T07:13:10Z

src/spikeinterface/qualitymetrics/tests/test_metrics_functions.py

-if __name__ == "__main__":
-
-    sorting_analyzer = _sorting_analyzer_simple()
-    print(sorting_analyzer)
-
-    test_unit_structure_in_output(_small_sorting_analyzer())
-
-    # test_calculate_firing_rate_num_spikes(sorting_analyzer)


@chrishalcrow can we add this back so @samuelgarcia is happy? :)

chrishalcrow · 2024-09-11T09:49:07Z

Hoping this is ready if tests pass

JoeZiminski · 2024-09-11T16:59:13Z

Looks good @chrishalcrow! One last question from above, would you mind adding pack PC metrics into the test just to see if it accounts for the change in test times? (just out of interest?) then can remove again and good to merge I think, @zm711 may want a final pass

zm711 · 2024-09-11T18:02:26Z

This looks good to me. I would love if you would just test adding PCA back so we can assess if that is the timing issue, but I could also do that in a test PR if it is too annoying. Otherwise this is fine by me. I would recommend changing the title of the PR so it won't be misleading in the changelog, but otherwise good by me.

zm711 · 2024-09-11T20:00:33Z

Adding that back took Windows tests from 2-4 minutes to 16-17 minutes. So my guess is that PCA stuff is not optimized for Windows. That's really good to know because my lab hates running PCA because it takes forever and that confirms this on Windows.

zm711

Good by me :)

simplify qm tests and wrap for multiprocessing

c61378e

chrishalcrow added qualitymetrics Related to qualitymetrics module testing Related to test routines performance Performance issues/improvements labels Jul 24, 2024

chrishalcrow marked this pull request as ready for review July 24, 2024 11:22

zm711 reviewed Jul 24, 2024

View reviewed changes

src/spikeinterface/qualitymetrics/tests/conftest.py Show resolved Hide resolved

zm711 reviewed Jul 24, 2024

View reviewed changes

src/spikeinterface/qualitymetrics/tests/test_quality_metric_calculator.py Outdated Show resolved Hide resolved

chrishalcrow added 3 commits July 24, 2024 13:18

try -1 instead of 2

df1564b

delete print statement

b5d896d

go back to n_jobs=1

9d3aa2a

JoeZiminski requested changes Jul 25, 2024

View reviewed changes

Respond to Joe review

79f0206

alejoe91 added this to the 0.101.1 milestone Aug 27, 2024

JoeZiminski requested review from JoeZiminski and zm711 September 10, 2024 13:00

JoeZiminski approved these changes Sep 10, 2024

View reviewed changes

alejoe91 reviewed Sep 11, 2024

View reviewed changes

chrishalcrow added 3 commits September 11, 2024 10:45

Put main stuff back

5d5afd2

oups

f3aac42

Merge branch 'main' into simplify-qm-tests

de2ade6

alejoe91 approved these changes Sep 11, 2024

View reviewed changes

compute pcs for sortinganalyzer again

b35873c

zm711 mentioned this pull request Sep 11, 2024

Compute principal components slow on Windows #3398

Open

re-remove pcs

41f73ed

chrishalcrow changed the title ~~Simplify quality metrics tests and wrap for multiprocessing~~ Refactor quality metrics tests to use fixture Sep 12, 2024

zm711 approved these changes Sep 12, 2024

View reviewed changes

alejoe91 merged commit 3d725a4 into SpikeInterface:main Sep 12, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor quality metrics tests to use fixture #3249

Refactor quality metrics tests to use fixture #3249

chrishalcrow commented Jul 24, 2024 •

edited

Loading

zm711 Jul 24, 2024

JoeZiminski Jul 24, 2024

zm711 Jul 24, 2024

JoeZiminski Jul 25, 2024

JoeZiminski left a comment •

edited

Loading

zm711 commented Jul 25, 2024

chrishalcrow commented Jul 28, 2024

zm711 commented Jul 29, 2024

JoeZiminski commented Sep 10, 2024

zm711 commented Sep 10, 2024

JoeZiminski commented Sep 10, 2024

alejoe91 Sep 11, 2024

chrishalcrow commented Sep 11, 2024

JoeZiminski commented Sep 11, 2024

zm711 commented Sep 11, 2024

zm711 commented Sep 11, 2024

zm711 left a comment

		@@ -570,27 +540,36 @@ def test_calculate_sd_ratio(sorting_analyzer_simple):

		if __name__ == "__main__":

Refactor quality metrics tests to use fixture #3249

Refactor quality metrics tests to use fixture #3249

Conversation

chrishalcrow commented Jul 24, 2024 • edited Loading

zm711 Jul 24, 2024

Choose a reason for hiding this comment

JoeZiminski Jul 24, 2024

Choose a reason for hiding this comment

zm711 Jul 24, 2024

Choose a reason for hiding this comment

JoeZiminski Jul 25, 2024

Choose a reason for hiding this comment

JoeZiminski left a comment • edited Loading

Choose a reason for hiding this comment

zm711 commented Jul 25, 2024

chrishalcrow commented Jul 28, 2024

zm711 commented Jul 29, 2024

JoeZiminski commented Sep 10, 2024

zm711 commented Sep 10, 2024

JoeZiminski commented Sep 10, 2024

alejoe91 Sep 11, 2024

Choose a reason for hiding this comment

chrishalcrow commented Sep 11, 2024

JoeZiminski commented Sep 11, 2024

zm711 commented Sep 11, 2024

zm711 commented Sep 11, 2024

zm711 left a comment

Choose a reason for hiding this comment

chrishalcrow commented Jul 24, 2024 •

edited

Loading

JoeZiminski left a comment •

edited

Loading