-
Notifications
You must be signed in to change notification settings - Fork 230
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] test_exact_percentile_groupby FAILED: hash_aggregate_test.py::test_exact_percentile_groupby with DATAGEN seed 1713362217 #10719
Comments
Failure not tot observed in today's nightly, without code update, 1023 FAILD, 1024 PASS, same Revision: 66f2cc5 keep monitoring! |
Can you document what the datagen seed was for original failure and try to repro it? We want to keep this open for original failure with what the datagen seed is. |
Updated title: DATAGEN seed = 1713362217 |
Diff is coming from CPU producing nulls when the GPU does not. Splitting out the differing columns on their own lines,
GPU:
|
Does this need a fixed seed, or do we need to fix the underlying problem? |
I have raised NVIDIA/spark-rapids-jni#2029. To me, it looks like a bug in how percentiles are derived from the constructed histograms. |
The seed that triggers this issue should be |
Describe the bug
test_exact_percentile_groupby FAILED: hash_aggregate_test.py::test_exact_percentile_groupby on DB-11.3
Detailed failures as below
The text was updated successfully, but these errors were encountered: