Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add build and test support for CUDA 12 #606

Merged
merged 35 commits into from
Feb 12, 2024
Merged

Add build and test support for CUDA 12 #606

merged 35 commits into from
Feb 12, 2024

Conversation

mlxd
Copy link
Member

@mlxd mlxd commented Feb 2, 2024

Before submitting

Please complete the following checklist when submitting a PR:

  • All new features must include a unit test.
    If you've fixed a bug or added code that should be tested, add a test to the
    tests directory!

  • All new functions and code must be clearly commented and documented.
    If you do make documentation changes, make sure that the docs build and
    render correctly by running make docs.

  • Ensure that the test suite passes, by running make test.

  • Add a new entry to the .github/CHANGELOG.md file, summarizing the
    change, and including a link back to the PR.

  • Ensure that code is properly formatted by running make format.

When all the above are checked, delete everything above the dashed
line and fill in the pull request template.


Context: This PR adds build and test support for CUDA 12 with lightning.gpu and sunsets CUDA 11 support in the shipped wheels.

Description of the Change: Updates the build tooling to use CUDA 12

Benefits: Natively supports CUDA 12

Possible Drawbacks: Additional build overhead if CUDA 11 is maintained.

Related GitHub Issues:

@mlxd
Copy link
Member Author

mlxd commented Feb 2, 2024

[sc-37516]

Copy link

codecov bot commented Feb 2, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (f3beabc) 98.48% compared to head (a2aaac3) 97.06%.

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #606      +/-   ##
==========================================
- Coverage   98.48%   97.06%   -1.43%     
==========================================
  Files         169      145      -24     
  Lines       24760    19276    -5484     
==========================================
- Hits        24386    18711    -5675     
- Misses        374      565     +191     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@codecov-commenter
Copy link

codecov-commenter commented Feb 6, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (f3beabc) 98.48% compared to head (c0c5a25) 98.68%.

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #606      +/-   ##
==========================================
+ Coverage   98.48%   98.68%   +0.19%     
==========================================
  Files         169      169              
  Lines       24760    24299     -461     
==========================================
- Hits        24386    23980     -406     
+ Misses        374      319      -55     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mlxd and others added 10 commits February 6, 2024 17:26
* Cache vcpkg libs and reuse

* Auto update version

* Trigger CI

* Fix matrix tests for windows

* Add excluded modules for OpenCPPCoverage

* Convert dash to underscore

* Use optimized build for Windows coverage

* Retrigger CI

---------

Co-authored-by: Dev version update bot <github-actions[bot]@users.noreply.github.com>
@mlxd mlxd marked this pull request as ready for review February 9, 2024 17:00
@mlxd mlxd requested review from vincentmr and AmintorDusko and removed request for vincentmr February 9, 2024 17:00
Copy link
Contributor

@AmintorDusko AmintorDusko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Short and sweet! Thank you for that!

@mlxd mlxd added the ci:use-multi-gpu-runner Enable usage of Multi-GPU runner for this Pull Request label Feb 9, 2024
Copy link
Contributor

@vincentmr vincentmr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you grep -r cu11 .github/workflows and fix the compat workflows and a few other string retaining cu11? Otherwise looks good, just a question.

@mlxd mlxd requested a review from vincentmr February 9, 2024 21:44
Copy link
Contributor

@vincentmr vincentmr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me now, happy to approve and feel free to merge once the CI goes green. Cheers!

@mlxd
Copy link
Member Author

mlxd commented Feb 9, 2024

OpenMPI 5.0 is causing failures with the distributed workload. We will need to rebuild the runner image using an older version and rerun before merging.

@mlxd mlxd merged commit 3527765 into master Feb 12, 2024
83 of 85 checks passed
@mlxd mlxd deleted the update/lgpu_cuda12 branch February 12, 2024 18:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci:use-multi-gpu-runner Enable usage of Multi-GPU runner for this Pull Request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants