Concatenate small input chunks before P2P rechunking #8832

hendrikmakait · 2024-08-14T07:27:35Z

~~Blocked by (and includes) #8831~~

Tests added / passed
Passes pre-commit run --all-files

github-actions · 2024-08-14T08:19:07Z

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

25 files ± 0 25 suites ±0 10h 9m 26s ⏱️ - 10m 16s
4 118 tests + 6 4 000 ✅ + 5 111 💤 - 2 7 ❌ +3
47 571 runs +60 45 457 ✅ +81 2 106 💤 - 25 8 ❌ +4

For more details on these failures, see this check.

Results for commit 5487c23. ± Comparison against base commit e9d8233.

This pull request removes 8 and adds 14 tests. Note that renamed tests count towards both.

distributed.tests.test_client ‑ test_client_connectionpool_semaphore_loop
distributed.tests.test_client ‑ test_client_gather_semaphore_loop
distributed.tests.test_utils_test ‑ test_dump_cluster_state
distributed.tests.test_utils_test ‑ test_dump_cluster_state_nannies
distributed.tests.test_utils_test ‑ test_dump_cluster_state_no_workers
distributed.tests.test_utils_test ‑ test_dump_cluster_state_timeout
distributed.tests.test_utils_test ‑ test_dump_cluster_state_unresponsive_local_worker
distributed.tests.test_utils_test ‑ test_dump_cluster_unresponsive_remote_worker

distributed.diagnostics.tests.test_install_plugin ‑ test_package_install_on_nanny
distributed.diagnostics.tests.test_install_plugin ‑ test_package_install_on_worker
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_3d[old0-new0-expected0]
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_3d[old1-new1-expected1]
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_3d[old2-new2-expected2]
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_3d[old3-new3-expected3]
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_concatenation[1 B-expected0]
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_concatenation[100 B-expected3]
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_concatenation[20 B-expected1]
distributed.shuffle.tests.test_rechunk ‑ test_calculate_prechunking_concatenation[40 B-expected2]
…

♻️ This comment has been updated with latest results.

hendrikmakait · 2024-08-19T07:55:04Z

A/B tests performed on coiled/benchmarks#1532 show a significant improvement in runtime for tests with small input chunks:

As expected, we also see a significant increase in memory usage for those tests:

The memory usage is still within very safe bounds.

Interestingly, we also see an increase in tiles_to_rows, once again within safe bounds:

This reverts commit c23fd8f.

phofl · 2024-08-22T12:17:41Z

The memory usage is still within very safe bounds.

Could you quantify this a little?

hendrikmakait · 2024-08-22T17:11:26Z

The memory usage is still within very safe bounds.

Could you quantify this a little?

Average memory increases by 10%-30% of the original value.

hendrikmakait · 2024-08-23T06:23:01Z

@phofl: As discussed offline, I've extended the documentation. LMK if anything is still unclear.

phofl

few very small comment, but lgtm now!

Thanks for the comments, that was very helpful

phofl · 2024-08-23T09:07:51Z

distributed/shuffle/tests/test_rechunk.py

+        (
+            ((2, 2), (1, 1, 1, 1), (1, 1, 1, 1)),
+            ((1, 1, 1, 1), (2, 2), (2, 2)),
+            ((2, 2), (2, 2), (2, 2)),


This one worries me a little bit. the max input chunk is 2, max output chunk is 4 but the algorithm concatenates in a way that we end with 8, which is not great

is the block size limit the upper bound here?

Yes, https://github.com/dask/distributed/pull/8832/files#diff-0b80e83452ff3472b265026d4516846014500b991e12f3de4a41b39a990afbc6R494 is the limit here, so in this case it's 8 because array.chunk-size is 16 B.

distributed/shuffle/_rechunk.py

phofl · 2024-08-23T09:22:28Z

distributed/shuffle/_rechunk.py

+    # by trying dimensions in decreasing value / weight order.
+    def key(k: int) -> float:
+        gse = graph_size_effect[k]
+        bse = block_size_effect[k]


Can you add a sentences what these 2 variables represent when you define them above? Took me a bit to figure this out

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

hendrikmakait added 4 commits August 13, 2024 22:23

Use task-based rechunking to prechunk along partial boundaries

6fd92b8

Move slicing_is_necessary

5b85a65

Add tests

dac50a9

Minor

d8a9c8d

hendrikmakait requested a review from fjetter as a code owner August 14, 2024 07:27

hendrikmakait marked this pull request as draft August 14, 2024 07:27

hendrikmakait added 3 commits August 15, 2024 10:44

REVERT ME: Adjust CI repository

c23fd8f

Minor

81760fa

Concatenate using task-based shuffling before P2P

6f7a107

hendrikmakait force-pushed the use-rechunking-to-prerechunk branch from f709b3c to 6f7a107 Compare August 15, 2024 08:51

Merge branch 'main' into use-rechunking-to-prerechunk

2ae27fd

hendrikmakait added 6 commits August 20, 2024 15:47

Merge branch 'main' into use-rechunking-to-prerechunk

1b7941d

Revert "REVERT ME: Adjust CI repository"

d269e03

This reverts commit c23fd8f.

Fix merge (WIP)

24bafbe

Fix merge (2/2)

1db2284

Fix concat logic

100fa43

Adjust tests

8035a4a

hendrikmakait marked this pull request as ready for review August 20, 2024 16:21

hendrikmakait added 2 commits August 21, 2024 09:48

Add tests

2c79a6e

Add test

73dfd99

hendrikmakait added 2 commits August 22, 2024 18:13

Add tests and fix ordering

3a119a5

Adopt algorithm from task-based rechunking

54f234e

hendrikmakait requested a review from phofl August 22, 2024 17:12

formatting

e4fdfdb

phofl reviewed Aug 23, 2024

View reviewed changes

hendrikmakait and others added 3 commits August 23, 2024 13:20

Update distributed/shuffle/_rechunk.py

34f6ee1

Co-authored-by: Patrick Hoefler <61934744+phofl@users.noreply.github.com>

Minor

d86ae35

Add docs

5487c23

hendrikmakait requested a review from phofl August 23, 2024 11:29

phofl approved these changes Aug 23, 2024

View reviewed changes

hendrikmakait merged commit ea7d35c into dask:main Aug 23, 2024
24 of 32 checks passed

hendrikmakait deleted the use-rechunking-to-prerechunk branch August 23, 2024 12:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concatenate small input chunks before P2P rechunking #8832

Concatenate small input chunks before P2P rechunking #8832

hendrikmakait commented Aug 14, 2024 •

edited

Loading

github-actions bot commented Aug 14, 2024 •

edited

Loading

hendrikmakait commented Aug 19, 2024

phofl commented Aug 22, 2024

hendrikmakait commented Aug 22, 2024 •

edited

Loading

hendrikmakait commented Aug 23, 2024

phofl left a comment

phofl Aug 23, 2024

phofl Aug 23, 2024

hendrikmakait Aug 23, 2024

phofl Aug 23, 2024

hendrikmakait Aug 23, 2024

Concatenate small input chunks before P2P rechunking #8832

Concatenate small input chunks before P2P rechunking #8832

Conversation

hendrikmakait commented Aug 14, 2024 • edited Loading

github-actions bot commented Aug 14, 2024 • edited Loading

Unit Test Results

hendrikmakait commented Aug 19, 2024

phofl commented Aug 22, 2024

hendrikmakait commented Aug 22, 2024 • edited Loading

hendrikmakait commented Aug 23, 2024

phofl left a comment

Choose a reason for hiding this comment

phofl Aug 23, 2024

Choose a reason for hiding this comment

phofl Aug 23, 2024

Choose a reason for hiding this comment

hendrikmakait Aug 23, 2024

Choose a reason for hiding this comment

phofl Aug 23, 2024

Choose a reason for hiding this comment

hendrikmakait Aug 23, 2024

Choose a reason for hiding this comment

hendrikmakait commented Aug 14, 2024 •

edited

Loading

github-actions bot commented Aug 14, 2024 •

edited

Loading

hendrikmakait commented Aug 22, 2024 •

edited

Loading