Make the sync marker uniform for the Avro coalescing reader #5428

firestarman · 2022-05-05T06:56:29Z

This PR is to fix the issue #5312 by appending the same sync marker to every block that being coalesced into a single Avro file.

It has mainly

introduced a new class named BatchContext, and a new method createBatchContext in the parent class of the coalescing reader. This new method returns a BatchContext who lives during the whole stage of building a coalesced memory file. By this design, the merged Avro header can be shared with the steps of estimating the output size, writing file header and building the block data for creating a batch.
updated the copyBlocksData and GPU Avro coalescing reader to support writing a given sync marker.
added the related tests.

Performance on Local

CPU 12 cores, and one GPU (Titan V, with 12GB memory)
Non-partitioned 2000 avro files, 4.4GB in total in LOCAL storage

CPU PERFILE COALESCING (non-uniform-sync) COALESCING (uniform-sync)

time(sec) 27.844 24.758 16.005 15.713

The numbers above show this change will not lead to any perf regression for the coalescing reader.

closes #5312

Signed-off-by: Firestarman firestarmanllc@gmail.com

Signed-off-by: Firestarman <firestarmanllc@gmail.com>

firestarman · 2022-05-05T07:03:59Z

build

jlowe · 2022-05-05T15:40:23Z

I'm confused on the reported performance results. Shouldn't this be reporting the performance of the GPU coalescing reader before/after this change rather than comparing the performance against the CPU? Seems like we're changing way too many variables between the two setups to isolate the performance impact of any single change (i.e.: changing CPU to GPU, non-coalescing to coalescing, uniform sync vs. non-uniform sync, etc.).

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuMultiFileReader.scala

Signed-off-by: Firestarman <firestarmanllc@gmail.com>

firestarman · 2022-05-06T01:15:05Z

I'm confused on the reported performance results. Shouldn't this be reporting the performance of the GPU coalescing reader before/after this change rather than comparing the performance against the CPU? Seems like we're changing way too many variables between the two setups to isolate the performance impact of any single change (i.e.: changing CPU to GPU, non-coalescing to coalescing, uniform sync vs. non-uniform sync, etc.).

I simply thought it could show us even with this change, we still get better perf than CPU.
Updated to align with the reports of PR #5306, plus an additional column to show the perf with this change.

firestarman · 2022-05-06T01:32:28Z

build

Make the sync marker uniform for coalescing reader

c77984f

Signed-off-by: Firestarman <firestarmanllc@gmail.com>

firestarman requested review from wbo4958, jlowe, GaryShen2008 and tgravescs May 5, 2022 06:58

firestarman added the bug Something isn't working label May 5, 2022

jlowe reviewed May 5, 2022

View reviewed changes

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuMultiFileReader.scala Outdated Show resolved Hide resolved

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuMultiFileReader.scala Outdated Show resolved Hide resolved

address the comments

da8d68b

Signed-off-by: Firestarman <firestarmanllc@gmail.com>

firestarman requested a review from jlowe May 6, 2022 01:29

jlowe approved these changes May 6, 2022

View reviewed changes

firestarman merged commit aa3dbab into NVIDIA:branch-22.06 May 7, 2022

firestarman deleted the coal-sync branch May 7, 2022 01:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the sync marker uniform for the Avro coalescing reader #5428

Make the sync marker uniform for the Avro coalescing reader #5428

firestarman commented May 5, 2022 •

edited

Loading

firestarman commented May 5, 2022

jlowe commented May 5, 2022

firestarman commented May 6, 2022 •

edited

Loading

firestarman commented May 6, 2022

Make the sync marker uniform for the Avro coalescing reader #5428

Make the sync marker uniform for the Avro coalescing reader #5428

Conversation

firestarman commented May 5, 2022 • edited Loading

Performance on Local

firestarman commented May 5, 2022

jlowe commented May 5, 2022

firestarman commented May 6, 2022 • edited Loading

firestarman commented May 6, 2022

firestarman commented May 5, 2022 •

edited

Loading

firestarman commented May 6, 2022 •

edited

Loading