Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update test case related to LEACY datetime format to unblock nightly CI #11545

Merged
merged 1 commit into from
Sep 30, 2024

Conversation

res-life
Copy link
Collaborator

closes #11543

Changes:

Update test case: only test years after 1900

Description

This issue is caused by year 1582.
Spark uses hybrid Julian+Gregorian calendar for LEGACY mode.

This issue is related to LEGACY mode, Spark itself has different behaviors between LEGACY mode and non-LEGACY mode, and Rapids kernel has not 100% matched non-LEGACY yet.
Rapids keeps consistent with Spark when mode is CORRECTED mode.

We already documented that LEGACY mode has several limitations

So only update test case.

Signed-off-by: Chong Gao res_life@163.com

Signed-off-by: Chong Gao <res_life@163.com>
@res-life
Copy link
Collaborator Author

For branch 24.12 branch, refer to #11544

@res-life
Copy link
Collaborator Author

build

@pxLi
Copy link
Collaborator

pxLi commented Sep 30, 2024

For branch 24.12 branch, refer to #11544

24.12 has diff change? 24.10 commits will be auto-merge into 24.12 by default

does #11539 would affect 24.12 only?

#11493 I see

def test_yyyyMMdd_format_for_legacy_mode():
gen = StringGen("[0-9]{3}[1-9](0[1-9]|1[0-2])(0[1-9]|[1-2][0-9])")
gen = StringGen('(19[0-9]{2}|[2-9][0-9]{3})([0-9]{4})')
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Q: (19[0-9]{2}|[2-9][0-9]{3})([0-9]{4}) seems not only modifying the year format, but also changes the mmdd?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, do not need to change mmdd.

Copy link
Collaborator

@pxLi pxLi Sep 30, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you didnt replying my question... You changed the mmdd in this change, was this intentional?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry for the previous reply.

You changed the mmdd in this change, was this intentional?

Yes, changed MM, and it's intentional.

The yyyy years after 1900 are OK.
For MM, invalid values like 99 month will result in NULL.
For dd, also any values [0-9]{2} is OK.

This PR is changing branch 24.10, and PR is changing 24.12.
On branch 24.12, it tests yyyymmdd beside of yyyyMMdd, and test case name renamed from test_yyyyMMdd_format_for_legacy_mode to test_formats_for_legacy_mode
mm: Minute.
MM: month.
So we will use the changes on 24.12.
And we expect the auto-merge from 24.10 to 24.12 will fall.

@res-life
Copy link
Collaborator Author

24.10 commits will be auto-merge into 24.12 by default

Will conflict, refer to 24.12 changes: #11544

@res-life res-life merged commit c74e2dd into NVIDIA:branch-24.10 Sep 30, 2024
45 of 46 checks passed
@res-life res-life deleted the fix-yyyyMMdd-case branch September 30, 2024 09:15
@sameerz sameerz added the test Only impacts tests label Oct 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
test Only impacts tests
Projects
None yet
3 participants