-
Notifications
You must be signed in to change notification settings - Fork 280
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error on querying NVIDIA devices | OverflowError: Python int too large to convert to C long #160
Comments
I encountered the same problem, and it's been solved by downgrading the nvidia-ml-py to a former version 11.525.112 using |
+1 same error
Thanks for the workaround @Lunar13737 , it worked for me. |
+1 and the workaround with downgrading
Any hints? |
I'd like to reproduce this issue to have a correct fix. But I've never seen the issue. What we know from #161 (comment):
@Lunar13737, @PyroGenesis, @mjmikulski thanks for the datapoints. Could you please try upgrading nvidia-ml-py==12.535.108 and see if the OverflowError is gone? |
@wookayin I can confirm, overflow error does not occur in nvidia-ml-py 12.535.108 |
@PyroGenesis Thanks. What was the previous version of nvidia-ml-py that resulted in this bug? |
@wookayin I think it was most likely 12.535.77 that caused the error, though I'm not 100% sure because I didn't keep a record of it. I downgraded to 11.525.112 which worked, and now 12.535.108 works too. |
@wookayin nvidia-ml-py 12.535.108 works for me, no overflow error |
Thanks. I can conclude that the root cause of this bug is essentially same as #161: one should use neither
|
nvidia-ml-py==12.535.77 is a buggy version that breaks the struct for process information, and should not be used (unless NVIDIA driver is *also* buggy, 535.43, 535.54, and 535.86). The latest version nvidia-ml-py==12.535.108 fixes the problem and is still compatible with our supported drivers (R450+). To ensure users who will install gpustat 1.2.0 have a correct version of nvidia-ml-py version installed, we bump up the requirement. See #160 and #161 for more details.
Describe the bug
Freshly installed
gpustat
. Upon runninggpustat
I get:gpustat --debug
:nvidia-smi
:Environment information:
1.2.dev7+g7c09a0f
It seems this bug has already been seen and solved over at
nvitop
XuehaiPan/nvitop#76The text was updated successfully, but these errors were encountered: