[BUG] GetJsonObject throws exception when json path contains a name starting with `'` #10537

thirtiseven · 2024-03-04T06:09:38Z

Describe the bug
$.'a is a valid path from Spark's check, so it is normalized to $[''a'] and passed to kernel in plugin, resulting in

ai.rapids.cudf.CudfException: CUDF failure on: /home/jenkins/agent/workspace/jenkins-spark-rapids-jni_nightly-dev-690-cuda11/thirdparty/cudf/cpp/src/json/json_path.cu:631: Invalid empty name in JSONPath query string

Note that $[''a'] is actually an invalid path from Spark's validation. So maybe the normalization we did in plugin b9c292c is losing some information when converting the List[PathInstruction] back to a string and causing some problems.

And at the same time, cuDF kernel can't match this case as well if we remove the normalization.

data: {"'a":"1"}

Spark, spark-rapids, cudf and jsonpath.com will give very different results in this case.

JSONPath	Spark	spark-rapids	cudf kernel(spark-rapids without normalization)	jsonpath.com
$[''a']	null	null	Invalid empty name in JSONPath query string	["1"]
$.'a	1	Invalid empty name in JSONPath query string	Encountered invalid JSONPath input string	No match

I think we can fallback on this case until we have a path parser that matches Spark's behavior.

The text was updated successfully, but these errors were encountered:

thirtiseven · 2024-03-04T07:17:49Z

rapidsai/cudf#15082 looks to be contributing to fixing this issue, will update after it is included in the plugin.

res-life · 2024-03-19T01:36:44Z

Will be fixed by: NVIDIA/spark-rapids-jni#1868

thirtiseven added bug Something isn't working ? - Needs Triage Need team to review and classify labels Mar 4, 2024

thirtiseven mentioned this issue Mar 4, 2024

[BUG] GetJsonObject should return null for invalid query instead of throwing an exception #10212

Closed

GaryShen2008 assigned thirtiseven Mar 5, 2024

sameerz removed the ? - Needs Triage Need team to review and classify label Mar 5, 2024

revans2 mentioned this issue Mar 13, 2024

[FEA] Fix GetJsonObject #10254

Open

15 tasks

thirtiseven mentioned this issue Mar 25, 2024

Use new jni kernel for getJsonObject #10581

Merged

thirtiseven closed this as completed in #10581 Mar 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] GetJsonObject throws exception when json path contains a name starting with `'` #10537

[BUG] GetJsonObject throws exception when json path contains a name starting with `'` #10537

thirtiseven commented Mar 4, 2024 •

edited

Loading

thirtiseven commented Mar 4, 2024

res-life commented Mar 19, 2024

[BUG] GetJsonObject throws exception when json path contains a name starting with ' #10537

[BUG] GetJsonObject throws exception when json path contains a name starting with ' #10537

Comments

thirtiseven commented Mar 4, 2024 • edited Loading

thirtiseven commented Mar 4, 2024

res-life commented Mar 19, 2024

[BUG] GetJsonObject throws exception when json path contains a name starting with `'` #10537

[BUG] GetJsonObject throws exception when json path contains a name starting with `'` #10537

thirtiseven commented Mar 4, 2024 •

edited

Loading