Template/expval #489

vincentmr · 2023-08-28T18:46:04Z

Before submitting

Please complete the following checklist when submitting a PR:

All new features must include a unit test.
If you've fixed a bug or added code that should be tested, add a test to the
tests directory!
All new functions and code must be clearly commented and documented.
If you do make documentation changes, make sure that the docs build and
render correctly by running make docs.
Ensure that the test suite passes, by running make test.
Add a new entry to the .github/CHANGELOG.md file, summarizing the
change, and including a link back to the PR.
Ensure that code is properly formatted by running make format.

When all the above are checked, delete everything above the dashed
line and fill in the pull request template.

Context:
This PR is a follow-up on #481. In the last PR, it appeared that reducing expval on the fly is generally faster than using inner products. Another factor is the computation of the observable-statevector product and the parallelization scheme used to do it. The general scheme uses three layers of parallelism with team policies. This introduces several parameters which should be tuned for optimal performance, but are currently left to Kokkos' heuristics to decide. On the other hand, the straightforward range policy-based scheme of the 1- and 2-qubit kernels outperforms the general scheme significantly.

Since this discrepancy does not appear explainable by the flop intensity increase between 2- and 3+-qubit kernels, I introduce specialized 3- to 5-qubit kernels. I draw the following conclusions:

On-the-fly expval kernels are generally faster.
Range-policy kernels are faster than the team-policy one up to 4-qubits on the OPENMP and HIP backends and up to 5-qubits on CUDA.

The following figures show timings to get the expectation value of a Hermitian observable for OPENMP, CUDA and HIP respectively.

Description of the Change:
Introduce specialized 3- to 5-qubit kernels. Refactor getExpValMatrix wrapper in MeasurementsKokkos.hpp. Add few tests.

Benefits:
Faster expval on all platforms, especially for 3+-qubit observables.

Possible Drawbacks:
None

Related GitHub Issues:
#481

…ata` to work with devices. M pennylane_lightning/core/src/simulators/lightning_kokkos/StateVectorKokkos.hpp; `applyMatrix` bugfix: use intermediate hostview to copy matrix data; same bugfix for `getDataVector`. M pennylane_lightning/core/src/simulators/lightning_kokkos/algorithms/AdjointJacobianKokkos.hpp; use copy constructor. M pennylane_lightning/core/src/simulators/lightning_kokkos/measurements/MeasurementsKokkos.hpp; use copy constructor. M pennylane_lightning/core/src/simulators/lightning_kokkos/observables/ObservablesKokkos.hpp; use copy constructor. M requirements-dev.txt; add clang-format-14.

… vector data in adjoint-diff.

…calls into two templated methods. Call specialized expval methods when possible. Remove obsolete 'Apply directly' tests.

…alueMultiQubitOpFunctor.

codecov · 2023-08-30T20:54:58Z

Codecov Report

Patch coverage: 100.00% and project coverage change: +6.04% 🎉

Comparison is base (869bbb8) 93.04% compared to head (5123082) 99.09%.
Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #489      +/-   ##
==========================================
+ Coverage   93.04%   99.09%   +6.04%     
==========================================
  Files         142      142              
  Lines       16278    16693     +415     
==========================================
+ Hits        15146    16542    +1396     
+ Misses       1132      151     -981

Files Changed	Coverage Δ
...tning_qubit/gates/tests/Test_OpToMemberFuncPtr.cpp	`18.46% <ø> (ø)`
pennylane_lightning/core/_version.py	`100.00% <100.00%> (ø)`
.../simulators/lightning_kokkos/StateVectorKokkos.hpp	`99.76% <100.00%> (+5.99%)`	⬆️
...s/gates/tests/Test_StateVectorKokkos_Generator.cpp	`100.00% <100.00%> (ø)`
...os/gates/tests/Test_StateVectorKokkos_NonParam.cpp	`100.00% <100.00%> (ø)`
...okkos/gates/tests/Test_StateVectorKokkos_Param.cpp	`100.00% <100.00%> (ø)`
...s/lightning_kokkos/measurements/ExpValFunctors.hpp	`100.00% <100.00%> (+43.06%)`	⬆️
...ghtning_kokkos/measurements/MeasurementsKokkos.hpp	`98.26% <100.00%> (+3.79%)`	⬆️
...asurements/tests/Test_StateVectorKokkos_Expval.cpp	`100.00% <100.00%> (ø)`
...surements/tests/Test_StateVectorKokkos_Measure.cpp	`100.00% <100.00%> (ø)`
... and 2 more

... and 9 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

AmintorDusko

I left only a few comments for now.
I see that we still have some work to do in terms of coverage.

.github/CHANGELOG.md

pennylane_lightning/core/src/simulators/lightning_kokkos/gates/README.md

pennylane_lightning/core/src/simulators/lightning_kokkos/measurements/ExpValFunctors.hpp

vincentmr · 2023-08-31T19:35:45Z

I left only a few comments for now. I see that we still have some work to do in terms of coverage.

I would like to merge #485 first to assess the coverage situation.

AmintorDusko · 2023-08-31T19:42:10Z

I left only a few comments for now. I see that we still have some work to do in terms of coverage.

I would like to merge #485 first to assess the coverage situation.

Absolutely, I think it is only sensible to do so.

AmintorDusko

Nothing more to add. Thank you for that!

mlxd

Nothing more to add --- thanks a bunch @vincentmr
I'm happy with the macro-approach for now, but we can revisit later to see if it can become some compile-time generated parameter-packed solution.

...ng/core/src/simulators/lightning_kokkos/measurements/tests/Test_StateVectorKokkos_Expval.cpp

vincentmr and others added 30 commits August 21, 2023 10:52

Auto update version

27b54eb

Update changelog.

8ad1a26

Merge branch 'master' into bugfix/cuda12

8375e7d

Merge branch 'master' into bugfix/cuda12

1098402

Auto update version

68881d1

Merge branch 'master' into bugfix/cuda12

e3df23b

Auto update version

fcc7fa3

Add an argument to adjointJacobian to avoid syncing and copying state…

48d9615

… vector data in adjoint-diff.

Reformat

3248276

trigger CI

504c228

[skip ci] Update changelog.

27f8e81

Introduce std::unordered_map<std::string, ExpValFunc> expval_funcs_.

c45cd23

Introduce applyExpectationValueFunctor.

33ff620

Add binding to LKokkos expval(matrix, wires). Combine expval functor …

e0d3212

…calls into two templated methods. Call specialized expval methods when possible. Remove obsolete 'Apply directly' tests.

Update changelog.

4305edc

Add test for arbitrary expval(Hermitian).

5595e3c

Add getExpectationValueMultiQubitOpFunctor.

22c47f4

Add typename hint for macos.

1e1565d

Add typename macos.

614e4de

Use Kokkos::ThreadVectorRange policy for innerloop in getExpectationV…

b1afba8

…alueMultiQubitOpFunctor.

Merge branch 'master' into bugfix/cuda12

9142b16

Auto update version

3b3ee66

Merge branch 'bugfix/cuda12' into accel/expval

7b22095

Merge branch 'master' into bugfix/cuda12

2c7cefc

Auto update version

6dc7883

Couple fix for HIP.

53b48d2

Merge branch 'bugfix/cuda12' into accel/expval

cb43f40

WIP

d31f1fa

Add specialized 3-5 qubit expval functors.

51b7497

vincentmr and others added 6 commits August 29, 2023 07:28

Bump pennylane version.

27d9c9d

Merge branch 'master' into template/expval

de984ce

Auto update version

edc94ea

Reimplement expval functors with macros.

255b971

Auto update version

aac80f3

Merge branch 'master' into template/expval

d56ec3f

vincentmr mentioned this pull request Aug 31, 2023

Add test coverage for LKokkos #485

Merged

5 tasks

vincentmr marked this pull request as ready for review August 31, 2023 17:06

vincentmr requested review from AmintorDusko and multiphaseCFD August 31, 2023 17:07

AmintorDusko reviewed Aug 31, 2023

View reviewed changes

.github/CHANGELOG.md Outdated Show resolved Hide resolved

pennylane_lightning/core/src/simulators/lightning_kokkos/gates/README.md Outdated Show resolved Hide resolved

pennylane_lightning/core/src/simulators/lightning_kokkos/measurements/ExpValFunctors.hpp Show resolved Hide resolved

Update CHANGELOG.md

230e94a

vincentmr and others added 9 commits September 6, 2023 08:09

Merge remote-tracking branch 'origin/master' into template/expval

d8cd73a

Auto update version

3e4e6cd

trigger CI

4855de5

trigger CI

313b5df

Bump Kokkos to 4.1.00 in CI.

bc7642b

Revert kokkos ver.

c977466

Add tests for macroed expval functors.

3c741fb

Remove redundant black lines.

aef8716

Use matrix interface to get expval of HermitianObs in LKokkos.

a1f0d4f

vincentmr requested a review from AmintorDusko September 6, 2023 19:32

Cover kokkos_args error.

5123082

AmintorDusko approved these changes Sep 7, 2023

View reviewed changes

mlxd approved these changes Sep 7, 2023

View reviewed changes

...ng/core/src/simulators/lightning_kokkos/measurements/tests/Test_StateVectorKokkos_Expval.cpp Show resolved Hide resolved

vincentmr merged commit e96a53f into master Sep 7, 2023
61 checks passed

vincentmr deleted the template/expval branch September 7, 2023 14:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Template/expval #489

Template/expval #489

vincentmr commented Aug 28, 2023 •

edited

Loading

codecov bot commented Aug 30, 2023 •

edited

Loading

AmintorDusko left a comment

vincentmr commented Aug 31, 2023

AmintorDusko commented Aug 31, 2023

AmintorDusko left a comment

mlxd left a comment

Template/expval #489

Template/expval #489

Conversation

vincentmr commented Aug 28, 2023 • edited Loading

Before submitting

codecov bot commented Aug 30, 2023 • edited Loading

Codecov Report

AmintorDusko left a comment

Choose a reason for hiding this comment

vincentmr commented Aug 31, 2023

AmintorDusko commented Aug 31, 2023

AmintorDusko left a comment

Choose a reason for hiding this comment

mlxd left a comment

Choose a reason for hiding this comment

vincentmr commented Aug 28, 2023 •

edited

Loading

codecov bot commented Aug 30, 2023 •

edited

Loading