Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add updates for CUDA 12 libraries #617

Merged
merged 15 commits into from
Feb 23, 2024
Merged

Add updates for CUDA 12 libraries #617

merged 15 commits into from
Feb 23, 2024

Conversation

mlxd
Copy link
Member

@mlxd mlxd commented Feb 23, 2024

Before submitting

Please complete the following checklist when submitting a PR:

  • All new features must include a unit test.
    If you've fixed a bug or added code that should be tested, add a test to the
    tests directory!

  • All new functions and code must be clearly commented and documented.
    If you do make documentation changes, make sure that the docs build and
    render correctly by running make docs.

  • Ensure that the test suite passes, by running make test.

  • Add a new entry to the .github/CHANGELOG.md file, summarizing the
    change, and including a link back to the PR.

  • Ensure that code is properly formatted by running make format.

When all the above are checked, delete everything above the dashed
line and fill in the pull request template.


Context: This PR updates the reported error messages from CUDA specific libraries, and avoids unversioned libs from being included in wheel builds (such as when using the NVHPC toolkit).

Description of the Change: As above. Ensure better error messaging and reporting on complex systems (such as Perlmutter).

Benefits:

Possible Drawbacks:

Related GitHub Issues:

@mlxd mlxd added the ci:use-multi-gpu-runner Enable usage of Multi-GPU runner for this Pull Request label Feb 23, 2024
Copy link
Contributor

Hello. You may have forgotten to update the changelog!
Please edit .github/CHANGELOG.md with:

  • A one-to-two sentence description of the change. You may include a small working example for new features.
  • A link back to this PR.
  • Your name (or GitHub username) in the contributors section.

Copy link

codecov bot commented Feb 23, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.85%. Comparing base (86f22a7) to head (baeb12a).

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #617      +/-   ##
==========================================
+ Coverage   98.70%   98.85%   +0.15%     
==========================================
  Files         169      203      +34     
  Lines       24019    29419    +5400     
==========================================
+ Hits        23707    29082    +5375     
- Misses        312      337      +25     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@mlxd mlxd changed the title Add additional unversioned libs for auditwheel excludes Add updates for CUDA 12 libraries Feb 23, 2024
@mlxd mlxd marked this pull request as ready for review February 23, 2024 20:34
@mlxd mlxd requested a review from a team February 23, 2024 20:34
Copy link
Contributor

@jay-selby jay-selby left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@vincentmr vincentmr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me. I'd just suggest uniformizing the case for cuBLAS, cuSPARSE and cuStateVec error messages. Either all lowercase, or trademark case maybe?

mlxd and others added 8 commits February 23, 2024 17:14
…Error.hpp

Co-authored-by: Vincent Michaud-Rioux <vincentm@nanoacademic.com>
…Error.hpp

Co-authored-by: Vincent Michaud-Rioux <vincentm@nanoacademic.com>
…Error.hpp

Co-authored-by: Vincent Michaud-Rioux <vincentm@nanoacademic.com>
…Error.hpp

Co-authored-by: Vincent Michaud-Rioux <vincentm@nanoacademic.com>
…Error.hpp

Co-authored-by: Vincent Michaud-Rioux <vincentm@nanoacademic.com>
…Error.hpp

Co-authored-by: Vincent Michaud-Rioux <vincentm@nanoacademic.com>
@mlxd mlxd merged commit 8794191 into master Feb 23, 2024
86 of 87 checks passed
@mlxd mlxd deleted the cuda_fixes branch February 23, 2024 22:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci:use-multi-gpu-runner Enable usage of Multi-GPU runner for this Pull Request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants