[FEA] support `spark.sql.legacy.timeParserPolicy` #50

revans2 · 2020-05-29T15:56:09Z

Is your feature request related to a problem? Please describe.
When parsing dates and times it would be good if we could also follow the spark.sql.legacy.timeParserPolicy config.

The text was updated successfully, but these errors were encountered:

sameerz · 2020-09-22T20:37:16Z

Trace through where this config is used in Spark and if the plugin cannot match the same functionality, fall back to the CPU.

- CUDF 0.9.1 - XGBoost4J 1.0.0-Beta2

andygrove · 2020-11-11T17:23:28Z

The default value for spark.sql.legacy.timeParserPolicy is EXCEPTION in which case Spark throws an exception if any of the following functions are unable to parse data using the specified pattern, and suggests that the conversion may work with LEGACY. If the config is set to CORRECTED then the conversion will return null instead of throwing an exception.

unix_timestamp
from_unixtime
from_utc_timstamp
to_unix_timestamp
to_utc_timestamp
to_date
to_timestamp
date_format

I propose that we follow the same behavior but fall back to CPU for LEGACY for these functions until we have a reason to add support for specific legacy formats that are no longer supported in Spark 3.0 and later. If we do end up doing that we can then just fall back to CPU for legacy formats that we do not support.

andygrove · 2020-11-18T16:19:23Z

Resolved by #1113 for functions, and I filed a follow on #1111 for handling this for CSV reads

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

revans2 added feature request New feature or request ? - Needs Triage Need team to review and classify SQL part of the SQL/Dataframe plugin labels May 29, 2020

sameerz removed the ? - Needs Triage Need team to review and classify label Sep 22, 2020

sameerz added the P0 Must have for release label Sep 22, 2020

sameerz added this to the Oct 26 - Nov 6 milestone Oct 23, 2020

andygrove self-assigned this Oct 23, 2020

wjxiz1992 pushed a commit to wjxiz1992/spark-rapids that referenced this issue Oct 29, 2020

Update docs for the new release (NVIDIA#50)

b1034bf

- CUDF 0.9.1 - XGBoost4J 1.0.0-Beta2

andygrove modified the milestones: Oct 26 - Nov 6, Nov 9 - Nov 20 Nov 6, 2020

andygrove mentioned this issue Nov 13, 2020

Support unix_timestamp on GPU for subset of formats #1113

Merged

andygrove closed this as completed Nov 18, 2020

tgravescs pushed a commit to tgravescs/spark-rapids that referenced this issue Nov 30, 2023

Update submodule cudf to 19dc46f (NVIDIA#50)

eb4f627

Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] support `spark.sql.legacy.timeParserPolicy` #50

[FEA] support `spark.sql.legacy.timeParserPolicy` #50

revans2 commented May 29, 2020

sameerz commented Sep 22, 2020

andygrove commented Nov 11, 2020 •

edited

Loading

andygrove commented Nov 18, 2020

[FEA] support spark.sql.legacy.timeParserPolicy #50

[FEA] support spark.sql.legacy.timeParserPolicy #50

Comments

revans2 commented May 29, 2020

sameerz commented Sep 22, 2020

andygrove commented Nov 11, 2020 • edited Loading

andygrove commented Nov 18, 2020

[FEA] support `spark.sql.legacy.timeParserPolicy` #50

[FEA] support `spark.sql.legacy.timeParserPolicy` #50

andygrove commented Nov 11, 2020 •

edited

Loading