Add distopia! #3914

hmacdope · 2022-11-10T00:18:14Z

Following bump of conda-forge feedstock of distopia to 0.2.0, including the inplace API we can now integrate distopia into MDAnalysis.

Note that currently only calc-bonds is implemented.
Note that a kludge is required to get the input array to the right datatype.

Changes made in this Pull Request:

Adds distopia to MDA distances library,

PR Checklist

Tests?
Docs?
CHANGELOG updated?
Issue raised/referenced?

codecov · 2022-11-10T00:35:06Z

Codecov Report

Base: 93.52% // Head: 93.53% // Increases project coverage by +0.00% 🎉

Coverage data is based on head (04bb71b) compared to base (94904b5).
Patch coverage: 100.00% of modified lines in pull request are covered.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #3914   +/-   ##
========================================
  Coverage    93.52%   93.53%           
========================================
  Files          190      191    +1     
  Lines        25037    25062   +25     
  Branches      3543     3547    +4     
========================================
+ Hits         23417    23442   +25     
  Misses        1099     1099           
  Partials       521      521

Impacted Files	Coverage Δ
package/MDAnalysis/lib/_distopia.py	`100.00% <100.00%> (ø)`
package/MDAnalysis/lib/distances.py	`97.68% <100.00%> (+0.06%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

orbeckst · 2022-11-11T21:24:50Z

Do we have ASV benchmarks that could show the before/after?

hmacdope · 2022-11-11T22:10:26Z

Do we have ASV benchmarks that could show the before/after?

I'll run them today. :)

orbeckst · 2022-11-11T22:14:45Z

I was more thinking about having a benchmark for calc_bonds and friends added to benchmarks in a separate PR. This this PR can extend them.

IAlibay

I assume this is in the works since it's WIP, but since discussion is happening:

Needs:

A CI runner (can just use extra conda deps argument to add a py3.10 runner with distopia)
- My view is that we don't need to do a full sweep here, distopia already tests relevant OS/Python versions, so it shouldn't be needed here.
Docs

.github/actions/setup-deps/action.yaml

azure-pipelines.yml

maintainer/conda/environment.yml

hmacdope · 2022-11-13T22:46:52Z

I was more thinking about having a benchmark for calc_bonds and friends added to benchmarks in a separate PR. This this PR can extend them.

@orbeckst there is a benchmark for calc_bonds, but only handles the no-box case. I think there is an open PR to add PBC based distance benchmarks in #3475.

hmacdope · 2022-11-13T22:53:09Z

See discussion on the distopia issue about whether it should be drop in replacement or selectable backend.

hmacdope · 2022-11-21T03:10:23Z

@richardjgowers, @orbeckst I have run some timings comparing distopia, MDA bindings to distopia and also to MDTraj/freud.

First thing to clear up is Richard and I had a discussion about checking the overhead of the _run() backend selector function and it had almost no effect from my benchmarks, so the rest of the benchmarks were run this way.

TLDR of this whole thing:

Our C++ layer is inherently slow probably due to problem 2
Our use of mixed precision and various casting that needs to take place makes everything sluggish.

Here is a comparison showing the time required to calculate N distances (x axis), fully complying to our API which takes float coordinates and returns double. As distopia doesn't allow mixed precision, this requires we cast the input datatype to float32 and then upcast to the result to float64 with something that looks like this

                    # need explicit branch on backend to prepare input types correctly
                    # must assign to result to get correct reference on memview
                    bondlengths = _run("calc_bonds_ortho_float",
                         args=(coords1, coords2, box[:3]),
                         kwargs={'results': bondlengths.astype(np.float32)},
                         backend=backend)
                    # upcast is currently required
                    bondlengths = bondlengths.astype(np.float64)

we can see that the MDA distopia bindings (purple) are far off its theoretical peak (blue) primarily due to the input and output casting required.

What happens if we remove the output cast which is an API break for MDA (guarantees double output)

We can see that the performance is a lot better.

While we can remove the input cast due to distopia requiring consistent precision, we can subtract its approximate overhead

With overhead of casting approximately removed (brown line) we are very close to the distopia-only maximum speed.

Hopefully this all makes sense.

My main proposal is we remove the guarantee that results are returned as doubles in 3.0. If people are amenable to this we can continue to dicsuss in a new issue?

richardjgowers

LGTM. Looking forward to going single precision in 3.0 for these reasons

hmacdope · 2022-11-21T11:07:40Z

LGTM. Looking forward to going single precision in 3.0 for these reasons

Issue raised #3927

Co-authored-by: Irfan Alibay <IAlibay@users.noreply.github.com>

hmacdope · 2023-01-08T02:03:01Z

Hang on some issues with passing through args and returning the right thing

IAlibay

Some initial comments.

Btw, we don't have to follow the black part of darken, it's more of an aid than anything else.

package/MDAnalysis/lib/distances.py

testsuite/MDAnalysisTests/lib/test_distances.py

Co-authored-by: Irfan Alibay <IAlibay@users.noreply.github.com>

package/MDAnalysis/lib/distances.py

hmacdope · 2023-01-20T03:45:23Z

I think this should be good for re-review

IAlibay

aside from the one thing that I'm not sure if it got resolved - lgtm! (sorry about the delay here)

Hugo Macdermott and others added 4 commits November 4, 2022 17:55

add starrt of calc_bonds

28bf2f9

refactor to use distopia inplace API

280126e

add in upcast

8a571d4

actually return the distances

d6c9220

github-actions bot added the Component-lib label Nov 10, 2022

hmacdope self-assigned this Nov 10, 2022

hmacdope added the CZI-performance performance track of CZIEOSS4 grant label Nov 10, 2022

hmacdope mentioned this pull request Nov 10, 2022

[Discuss, dont close] Precision of distopia functions may vary with architecture #3915

Open

IAlibay requested changes Nov 11, 2022

View reviewed changes

hmacdope added 2 commits November 12, 2022 19:53

distopia to ci

dae07fa

add initial docs

f58b8e1

github-actions bot added the Continuous Integration label Nov 12, 2022

IAlibay requested changes Nov 12, 2022

View reviewed changes

.github/actions/setup-deps/action.yaml Show resolved Hide resolved

azure-pipelines.yml Outdated Show resolved Hide resolved

IAlibay requested changes Nov 12, 2022

View reviewed changes

maintainer/conda/environment.yml Outdated Show resolved Hide resolved

hmacdope added 3 commits November 21, 2022 10:54

move to using _run backend

07016f5

add upcast back in

908f6fe

fix yml CI files

072b743

github-actions bot removed the Continuous Integration label Nov 21, 2022

hmacdope added 2 commits November 21, 2022 14:20

change test_pairwise_dist to use allclose rather than equals

05fe804

change another test to sue assert allclose

1d1300c

richardjgowers reviewed Nov 21, 2022

View reviewed changes

hmacdope mentioned this pull request Nov 21, 2022

Return distance results as float rather than double for 3.0 or make return type flexible. #3927

Open

hmacdope and others added 11 commits January 7, 2023 22:41

Update package/MDAnalysis/lib/distances.py

46f471c

Co-authored-by: Irfan Alibay <IAlibay@users.noreply.github.com>

add align

9105b0b

WIP on stub

1f9e5d5

finalise distopia stub

e91d4e8

change calc_bonds to use distopia stub

402b90c

fix type annotation

f7b79fc

Merge remote-tracking branch 'upstream/develop' into Add_distopia

416afcc

darker

1ec7445

darker distances.py

9e0ea7d

try to make black happier again

9ff3ecf

fix distances.py again

e2d606f

hmacdope requested a review from IAlibay January 8, 2023 01:47

IAlibay requested changes Jan 8, 2023

View reviewed changes

hmacdope and others added 4 commits January 8, 2023 23:40

finally fix everything

028f548

fix versionchanged

ec04406

remove try except

8f98806

Update package/MDAnalysis/lib/distances.py

e185b7a

Co-authored-by: Irfan Alibay <IAlibay@users.noreply.github.com>

hmacdope commented Jan 8, 2023

View reviewed changes

package/MDAnalysis/lib/distances.py Show resolved Hide resolved

hmacdope requested a review from IAlibay January 16, 2023 01:57

Merge remote-tracking branch 'upstream/develop' into Add_distopia

cba5b5f

hmacdope requested review from richardjgowers and orbeckst January 20, 2023 03:45

hmacdope and others added 3 commits January 27, 2023 08:58

Merge remote-tracking branch 'upstream/develop' into Add_distopia

ea8185b

finalise merge

403b7e6

Merge branch 'develop' into Add_distopia

04bb71b

IAlibay approved these changes Feb 13, 2023

View reviewed changes

hmacdope merged commit d27a32a into MDAnalysis:develop Feb 15, 2023

IAlibay added the enhancement label Sep 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add distopia! #3914

Add distopia! #3914

hmacdope commented Nov 10, 2022 •

edited

Loading

codecov bot commented Nov 10, 2022 •

edited

Loading

orbeckst commented Nov 11, 2022

hmacdope commented Nov 11, 2022

orbeckst commented Nov 11, 2022

IAlibay left a comment

hmacdope commented Nov 13, 2022

hmacdope commented Nov 13, 2022

hmacdope commented Nov 21, 2022

richardjgowers left a comment

hmacdope commented Nov 21, 2022

hmacdope commented Jan 8, 2023

IAlibay left a comment

hmacdope commented Jan 20, 2023

IAlibay left a comment

Add distopia! #3914

Add distopia! #3914

Conversation

hmacdope commented Nov 10, 2022 • edited Loading

PR Checklist

codecov bot commented Nov 10, 2022 • edited Loading

Codecov Report

orbeckst commented Nov 11, 2022

hmacdope commented Nov 11, 2022

orbeckst commented Nov 11, 2022

IAlibay left a comment

Choose a reason for hiding this comment

hmacdope commented Nov 13, 2022

hmacdope commented Nov 13, 2022

hmacdope commented Nov 21, 2022

richardjgowers left a comment

Choose a reason for hiding this comment

hmacdope commented Nov 21, 2022

hmacdope commented Jan 8, 2023

IAlibay left a comment

Choose a reason for hiding this comment

hmacdope commented Jan 20, 2023

IAlibay left a comment

Choose a reason for hiding this comment

hmacdope commented Nov 10, 2022 •

edited

Loading

codecov bot commented Nov 10, 2022 •

edited

Loading