Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove the mixedType config for JSON as it has no downsides any longer #10716

Merged
merged 2 commits into from
Apr 24, 2024

Conversation

revans2
Copy link
Collaborator

@revans2 revans2 commented Apr 16, 2024

mixedTypesAsStrings was added in when we relied on CUDF to do schema discovery when parsing JSON. That is no longer the case and there are no real downsides to it any longer. So remove the config because it is not really needed.

This patch sets it to always be on, as that is the direction we want to go in.

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
@revans2
Copy link
Collaborator Author

revans2 commented Apr 16, 2024

build

Comment on lines 371 to 372
Mixed types can have some problem. If an item being read could have some lines that are arrays
and others that are structs/dictionaries it is possible for an error will be thrown.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Mixed types can have some problem. If an item being read could have some lines that are arrays
and others that are structs/dictionaries it is possible for an error will be thrown.
Mixed types can have some problems. If an item being read could have some lines that are arrays
and others that are structs/dictionaries it is possible an error will be thrown.

@@ -268,6 +268,7 @@ object GpuJsonReadCommon {

private def throwMismatchException(cv: ColumnView,
dt: DataType): (Option[ColumnView], Seq[AutoCloseable]) = {
ai.rapids.cudf.TableDebug.get.debug(s"JSON MISMATCH $dt", cv)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Intentional or debug code left in?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oops

@revans2
Copy link
Collaborator Author

revans2 commented Apr 16, 2024

build

@sameerz sameerz added the task Work required that improves the product but is not user facing label Apr 17, 2024
@revans2 revans2 merged commit d321e6b into NVIDIA:branch-24.06 Apr 24, 2024
43 checks passed
@revans2 revans2 deleted the default_mixed_type branch April 24, 2024 20:25
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
task Work required that improves the product but is not user facing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants