[FEA] Find and un-nest withResource where appropriate #6758

abellina · 2022-10-11T17:42:48Z

We have a nice helper function withResource that has helped us emulate "try-with-resources" semantics from Java in Scala.

This can be abused, however. Here's an example:

withResource(iter.next()) { batch => 
  withResource(GpuColumnVector.from(batch)) { table => 
    withResource(table.callCudfFunction()) { table2 => 
      withResource(table2.callAnotherCudfFunction()) { table3 => 
      }
    }
  }
}

In this code we first transform batch into table, this doesn't incur new GPU memory, but table.callCudfFunction() returns a new table, table2, and so when we call the second cuDF function on table2 we now have in GPU memory: table, table2 and now table3.

Ideally the code above can be broken down into multiple chunks. Note that this isn't possible or clean in all cases, so this isn't an easy task.

val table = withResource(iter.next()) { batch => 
  GpuColumnVector.from(batch)
} 

val table2 = withResource(table) { _ =>
  table.callCudfFunction()
}

// we now only have `table2` in memory, instead of `table` and `table2` prior to making `table3`.

val table3 = withResource(table2) { _ =>
  table2.callAnotherCudfFunction()
}

First work on this issue should probably be to find big parts of the code where we want to focus, perhaps with some smarts to detect the issue. I could see several PRs being generated, per exec that suffers from the above, as we go through the code.

The text was updated successfully, but these errors were encountered:

Fixes NVIDIA#6758 after inspecting the result of a multiline regex search ``` (.*withResource.*[\n]){4,} ``` under modules src/main Signed-off-by: Gera Shegalov <gera@apache.org>

Contributes to #6758 after inspecting the result of a multiline regex search ``` (.*withResource.*[\n]){4,} ``` under modules src/main Signed-off-by: Gera Shegalov <gera@apache.org>

abellina · 2022-12-09T15:09:32Z

The original intent of this issue was to figure out how to instrument withResource to figure out memory growth, but we made inroads into memory waste by looking at the nested level, and by brute force running benchmarks with larger scales and less resources. That said, I think we should move towards measuring memory explosion using the Rmm apis for scoped maximum usage added in 22.12 via #6745, and enforce using #7257

abellina added feature request New feature or request ? - Needs Triage Need team to review and classify reliability Features to improve reliability or bugs that severly impact the reliability of the plugin labels Oct 11, 2022

abellina mentioned this issue Oct 11, 2022

[TASK] Run without fatal OOMs #6746

Closed

10 tasks

sameerz removed feature request New feature or request ? - Needs Triage Need team to review and classify labels Oct 11, 2022

mattahrens assigned abellina and gerashegalov Oct 14, 2022

gerashegalov mentioned this issue Oct 18, 2022

Flatten simple 4+ nesting of withResource #6833

Merged

This was referenced Oct 19, 2022

Reduce memory usage in aggregate.scala #6859

Merged

[BUG] GpuPartitioning should close CVs before releasing semaphore #6913

Merged

abellina mentioned this issue Oct 27, 2022

mergeSort late batch materialization and free already merged batches eagerly #6931

Merged

abellina closed this as completed Dec 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Find and un-nest withResource where appropriate #6758

[FEA] Find and un-nest withResource where appropriate #6758

abellina commented Oct 11, 2022 •

edited

Loading

abellina commented Dec 9, 2022

[FEA] Find and un-nest withResource where appropriate #6758

[FEA] Find and un-nest withResource where appropriate #6758

Comments

abellina commented Oct 11, 2022 • edited Loading

abellina commented Dec 9, 2022

abellina commented Oct 11, 2022 •

edited

Loading