Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add string.split APIs to pylibcudf #16940

Merged

Conversation

mroeschke
Copy link
Contributor

Description

Contributes to #15162

Includes split/split.pxd and split/partition.pxd

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@mroeschke mroeschke added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change pylibcudf Issues specific to the pylibcudf package labels Sep 26, 2024
@github-actions github-actions bot added Python Affects Python cuDF API. CMake CMake build issue labels Sep 26, 2024
@mroeschke mroeschke marked this pull request as ready for review September 27, 2024 23:14
@mroeschke mroeschke requested a review from a team as a code owner September 27, 2024 23:14
from cython.operator import dereference


cpdef Table partition(Column input, Scalar delimiter):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add the empty string default value here? We've done it like this so far.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah OK. I think I may have missed this default value pattern to match the libcudf default value in my recently pylibcudf PRs. I'll got back in a follow up to try to address this.

return pa_array, plc_column


@pytest.fixture
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We might consider parameterizing over None here as the delimiter as well once the APIs handle the default value

Copy link
Contributor

@brandon-b-miller brandon-b-miller left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One change otherwise LGTM

@mroeschke
Copy link
Contributor Author

/merge

@rapids-bot rapids-bot bot merged commit bac81cb into rapidsai:branch-24.12 Oct 2, 2024
98 checks passed
@mroeschke mroeschke deleted the pylibcucudf/strings/split branch October 2, 2024 02:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CMake CMake build issue improvement Improvement / enhancement to an existing function non-breaking Non-breaking change pylibcudf Issues specific to the pylibcudf package Python Affects Python cuDF API.
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants