Implement getShuffleRDD and fixup mismatched output types on shuffle reuse [databricks] #4257

jlowe · 2021-12-01T21:46:41Z

This implements the getShuffleRDD interface for GpuShuffleExchangeExec and fixes an issue with ReusedExchangeExec caching CPU aggregation buffer types before they're updated with GPU aggregation buffer types. The plan is searched for instances where the ReusedShuffleExec output types do not match the underlying GPU shuffle output types, fixing the output types of ReusedShuffleExec to match the GPU shuffle.

…reuse Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe · 2021-12-01T21:46:50Z

build

jlowe · 2021-12-02T02:28:20Z

Databricks 9.1 build failed due to access restrictions on newReuseInstance. Added a shim for this and my best guess at a workaround for it.

jlowe · 2021-12-02T02:28:26Z

build

gerashegalov

LGTM

gerashegalov · 2021-12-02T08:35:56Z

sql-plugin/src/main/301+-nondb/scala/com/nvidia/spark/rapids/shims/v2/AQEUtils.scala

+import org.apache.spark.sql.execution.adaptive.{QueryStageExec, ShuffleQueryStageExec}
+
+/** Utility methods for manipulating Catalyst classes involved in Adaptive Query Execution */
+object AQEUtils {


same as the db version? hopefully this will be combined after the commonizing PR #4235

I don't think #4235 is going to address this. This change is only needed by spark312db, and there's no source directory for "everything except spark312db." I didn't bother to create one for this since the code is so small.

…reuse [databricks] (NVIDIA#4257) * Implement getShuffleRDD and fixup mismatched output types on shuffle reuse Signed-off-by: Jason Lowe <jlowe@nvidia.com> * Fix Databricks 9.1 build

Implement getShuffleRDD and fixup mismatched output types on shuffle …

60226ed

…reuse Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe self-assigned this Dec 1, 2021

jlowe linked an issue Dec 1, 2021 that may be closed by this pull request

[BUG] AQE Crashing Spark RAPIDS when using filter() and union() #4216

Closed

revans2 previously approved these changes Dec 1, 2021

View reviewed changes

abellina previously approved these changes Dec 1, 2021

View reviewed changes

Fix Databricks 9.1 build

1f8bcf6

jlowe dismissed stale reviews from abellina and revans2 via 1f8bcf6 December 2, 2021 02:27

gerashegalov approved these changes Dec 2, 2021

View reviewed changes

revans2 approved these changes Dec 2, 2021

View reviewed changes

jlowe merged commit bad9fee into NVIDIA:branch-21.12 Dec 2, 2021

jlowe deleted the fix-shufflerdd-partitionspec branch December 2, 2021 14:55

This was referenced Dec 2, 2021

[BUG] AQE Crashing Spark RAPIDS when using filter() and union() #4216

Closed

Commonize v2 shim [databricks] #4235

Merged

sameerz added the bug Something isn't working label Dec 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement getShuffleRDD and fixup mismatched output types on shuffle reuse [databricks] #4257

Implement getShuffleRDD and fixup mismatched output types on shuffle reuse [databricks] #4257

jlowe commented Dec 1, 2021

jlowe commented Dec 1, 2021

jlowe commented Dec 2, 2021

jlowe commented Dec 2, 2021

gerashegalov left a comment

gerashegalov Dec 2, 2021

jlowe Dec 2, 2021

Implement getShuffleRDD and fixup mismatched output types on shuffle reuse [databricks] #4257

Implement getShuffleRDD and fixup mismatched output types on shuffle reuse [databricks] #4257

Conversation

jlowe commented Dec 1, 2021

jlowe commented Dec 1, 2021

jlowe commented Dec 2, 2021

jlowe commented Dec 2, 2021

gerashegalov left a comment

Choose a reason for hiding this comment

gerashegalov Dec 2, 2021

Choose a reason for hiding this comment

jlowe Dec 2, 2021

Choose a reason for hiding this comment