Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Null characters do not work in regular expression character classes #5909

Closed
3 tasks done
anthony-chang opened this issue Jun 24, 2022 · 0 comments · Fixed by #5976
Closed
3 tasks done

[BUG] Null characters do not work in regular expression character classes #5909

anthony-chang opened this issue Jun 24, 2022 · 0 comments · Fixed by #5976
Assignees
Labels
bug Something isn't working

Comments

@anthony-chang
Copy link
Contributor

anthony-chang commented Jun 24, 2022

Describe the bug
We will fallback to CPU for null characters inside a character class.

Steps/Code to reproduce bug
Run any regular expression with \u0000 inside a character class.

Expected behavior

  • Do not throw unsupported exception for null characters in character classes, and transpile them to the octal/hex representation
  • Add \u0000 to the predefined classes \p{ASCII} and \p{Cntrl}
  • Update compatibility docs

Environment details (please complete the following information)
N/A

Additional context
Depends on rapidsai/cudf#11112
Related to #5834

@anthony-chang anthony-chang added bug Something isn't working ? - Needs Triage Need team to review and classify labels Jun 24, 2022
@anthony-chang anthony-chang self-assigned this Jun 24, 2022
@sameerz sameerz removed the ? - Needs Triage Need team to review and classify label Jun 28, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants