[FEA] Support GpuHashAggregateExec in LoRe #10942

winningsix · 2024-05-29T07:33:27Z

Is your feature request related to a problem? Please describe.
Support Agg in LoRE. Both partial and final.

spark-rapids/sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuAggregateExec.scala

Lines 1711 to 1715 in bb05b17

    
           case class GpuHashAggregateExec( 
        
               requiredChildDistributionExpressions: Option[Seq[Expression]], 
        
               groupingExpressions: Seq[NamedExpression], 
        
               aggregateExpressions: Seq[GpuAggregateExpression], 
        
               aggregateAttributes: Seq[Attribute],

revans2 · 2024-05-29T13:25:45Z

Do you have any plans on how to support this? Project is fairly simple because it is a one to one relationship with input and output batches. Technically it could be a one to many relationship if we run out of memory and split an input batch to make it work. How would this work for hash aggregate that can have a many to many relationship? Even trying to trigger it on how long it took to run feels problematic because we might not see any slowness until after the first batch, which then would require us to save around all input batches or save them out on the chance that they might be needed to reproduce the problem.

winningsix · 2024-06-05T02:54:56Z

@revans2 Sorry for my late reply. Internally we had a discussion around this with @binmahone @res-life @liurenjie1024 @GaryShen2008. Just as what we offline discussed, we will change the granularity from batch to task level. Thus, it should work for stateful operator like aggregation. Also, regards to the dump timing, we're considering introducing other two modes: i. exact id matching via task id or split id; ii. dumping first few tasks. Later one can help non-tailing case. For this part, let's explore option whether we can be consistent with @jlowe 's profiler tool. @liurenjie1024 will help on that later and @binmahone is helping explore options regards id matching approach.

binmahone · 2024-06-06T06:56:31Z

hi @revans2

In the new LORE implementation we'll use two IDs to uniquely identify the lifespan of a specific operator in a specific task:

LORE ID, to identify an operator in the SQL. We can't use SparkPlan.id as it is not stable. We will use sth like a DFS traversal to number each operator as LORE ID, and print the LORE ID in spark UI for each operator. AQE case will also be covered.
Parittion ID, since there will be N concurrent task for a specific operator, we need to identify which one. We can't use taskID as it is unstable. We'll instead use RDD's parititon index.

Consider a case where we have skew in e.g. JoinExec, the skew task will exhibit consistent LORE ID+Parittion ID among different runs of the same SQL (Even in the same spark session). With this design, users can dump data only related to the problematic operator in a specific task, and we can replay the specific operator at local in a single thread.

The LORE ID+Partition ID design can also be extend to enable self-contained profiling (#10870). Currently, #10870 can be enabled based on time range/job range/stage range. However job range and stage range should be considered unstable and may result in unexpected traces dumped. With LORE ID+Partition ID, we are more specific and acurate to express what traces we need. (LORE ID+PartitionID can uniquely identify which task on which executor)

This is how we see it, what you think @revans2 @jlowe @GaryShen2008 ? @winningsix @liurenjie1024 please feel free to add your inputs.

winningsix added feature request New feature or request ? - Needs Triage Need team to review and classify labels May 29, 2024

winningsix mentioned this issue May 29, 2024

[FEA] LoRe framework - Support operator specific dump tool for performance issue local reproduce #10843

Open

mattahrens removed the ? - Needs Triage Need team to review and classify label Jun 4, 2024

binmahone mentioned this issue Jun 5, 2024

prototype for new design LORE #10983

Closed

binmahone mentioned this issue Jun 7, 2024

prototype for new design LORE (with LORE id) #10999

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Support GpuHashAggregateExec in LoRe #10942

[FEA] Support GpuHashAggregateExec in LoRe #10942

winningsix commented May 29, 2024

revans2 commented May 29, 2024

winningsix commented Jun 5, 2024

binmahone commented Jun 6, 2024 •

edited

Loading

[FEA] Support GpuHashAggregateExec in LoRe #10942

[FEA] Support GpuHashAggregateExec in LoRe #10942

Comments

winningsix commented May 29, 2024

revans2 commented May 29, 2024

winningsix commented Jun 5, 2024

binmahone commented Jun 6, 2024 • edited Loading

binmahone commented Jun 6, 2024 •

edited

Loading