Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve JSON and CSV support for boolean values #4780

Merged
merged 3 commits into from
Feb 15, 2022

Conversation

andygrove
Copy link
Contributor

@andygrove andygrove commented Feb 14, 2022

Signed-off-by: Andy Grove andygrove@nvidia.com

Closes #2071 and #4763

This PR improves support for reading boolean values from JSON and CSV by reading the values as strings in cuDF and then applying a CAST operation to parse the string as boolean with compatibility with Spark.

This does not work correctly for JSON where there are strings as input (such as "true" rather than true) and there is a follow-on issue #4779 to fix this

Signed-off-by: Andy Grove <andygrove@nvidia.com>
revans2
revans2 previously approved these changes Feb 14, 2022
@andygrove andygrove changed the title WIP: Improve JSON and CSV support for boolean values Improve JSON and CSV support for boolean values Feb 14, 2022
@andygrove andygrove marked this pull request as ready for review February 14, 2022 19:23
@andygrove
Copy link
Contributor Author

build

@andygrove
Copy link
Contributor Author

build

@andygrove andygrove marked this pull request as draft February 14, 2022 21:21
@andygrove andygrove changed the title Improve JSON and CSV support for boolean values WIP: Improve JSON and CSV support for boolean values Feb 14, 2022
@andygrove
Copy link
Contributor Author

I found some edge cases that are not handled correctly so I am moving this back to draft / WIP for now

@andygrove andygrove changed the title WIP: Improve JSON and CSV support for boolean values Improve JSON and CSV support for boolean values Feb 14, 2022
@andygrove andygrove marked this pull request as ready for review February 14, 2022 23:15
@andygrove
Copy link
Contributor Author

build

@andygrove andygrove linked an issue Feb 15, 2022 that may be closed by this pull request
@andygrove andygrove merged commit 5132869 into NVIDIA:branch-22.04 Feb 15, 2022
@andygrove andygrove deleted the json-boolean branch February 15, 2022 20:13
@sameerz sameerz added the bug Something isn't working label Feb 18, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
3 participants