Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Enable regular expressions by default #4509

Open
55 of 61 tasks
andygrove opened this issue Jan 12, 2022 · 3 comments
Open
55 of 61 tasks

[FEA] Enable regular expressions by default #4509

andygrove opened this issue Jan 12, 2022 · 3 comments
Labels
epic Issue that encompasses a significant feature or body of work feature request New feature or request

Comments

@andygrove
Copy link
Contributor

andygrove commented Jan 12, 2022

Is your feature request related to a problem? Please describe.
Regular expression support is currently disabled by default due to many known compatibility issues, which are documented in the compatibility guide. This epic is to track the work required to address these issues and enable the feature by default.

Completed

High Priority

Medium Priority

Low Priority

Describe the solution you'd like
Support the regular expressions functions and expressions by default with 100% compatibility with Spark:

  • regexp / regexp_like / RLIKE
  • regexp_replace
  • regexp_extract
  • regexp_extract_all
  • split

Describe alternatives you've considered
None

Additional context
None

@andygrove andygrove added feature request New feature or request ? - Needs Triage Need team to review and classify epic Issue that encompasses a significant feature or body of work labels Jan 12, 2022
@jlowe
Copy link
Member

jlowe commented Jan 12, 2022

@andygrove FYI I added #4511 to the list, since I think we need to improve the current situation where regex kernels can fail with a confusing OOM error due to insufficient reserved memory rather than insufficient pool memory.

@sameerz sameerz removed the ? - Needs Triage Need team to review and classify label Jan 18, 2022
@sperlingxx
Copy link
Collaborator

sperlingxx commented Mar 30, 2022

Hi @andygrove, I found another bug about regexp_extract #5088. Shall we put it in the list ?

@sperlingxx
Copy link
Collaborator

Hi @andygrove, I added #5135 to the list as a high priority task, since I think it is a correctness issue which is not only triggered by corner cases.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic Issue that encompasses a significant feature or body of work feature request New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants