fix: make add_piece taking less time #1707

vmx · 2023-05-24T13:30:32Z

add_piece operates in 64 bytes pieces. It's faster to operate in bigger chunks. This commits increases the buffer size to 4KiB. This makes adding a 32GiB piece about more than 2x faster. On the hardware I used it goes down from 14min to about 6min.

`add_piece` operates in 64 bytes pieces. It's faster to operate in bigger chunks. This commits increases the buffer size to 4KiB. This makes adding a 32GiB piece about more than 2x faster. On the hardware I used it goes down from 14min to about 6min.

Kubuxu · 2023-05-24T17:28:00Z

filecoin-proofs/src/commitment_reader.rs

@@ -7,11 +7,13 @@ use rayon::prelude::{ParallelIterator, ParallelSlice};

 use crate::{constants::DefaultPieceHasher, pieces::piece_hash};

+const BUFFER_SIZE: usize = 4096;


Have you tried bumping it even further, for example, up to 1MiB-8MiB?
Even with the 4KiB, it is just 64 SHA invocations, the branch predictor is probably just warming up at that point.

I've tried 8MiB it didn't make a difference.

I think yet another difference would make it if the hashing would be done in parallel. But a simple "par chunker" didn't really help, as then the pieces are to small. One need to implement it manually. I decided that it's not worth it and that this improvement is already good enough for how simple it is.

cryptonemo · 2023-05-24T18:28:13Z

Can we check if lotus is using this method? I recall it not being used, but may be wrong there. In any case, it should help speed up our big tests.

vmx · 2023-05-25T07:38:54Z

Can we check if lotus is using this method?

write_with_alignment seems to be the only API call in the FFI using it. It is then called WriteWithAlignment on the Go side. If I grep the Lotus repo for that, I only find it used in tests.

Why is that important? It's not a breaking change, it just improves things. So if Lotus would use it, it would also be beneficial for them.

cryptonemo · 2023-05-25T12:25:05Z

Can we check if lotus is using this method?

write_with_alignment seems to be the only API call in the FFI using it. It is then called WriteWithAlignment on the Go side. If I grep the Lotus repo for that, I only find it used in tests.

Why is that important? It's not a breaking change, it just improves things. So if Lotus would use it, it would also be beneficial for them.

I just wanted to know if we'd see the speed-ups outside of tests.

vmx requested review from cryptonemo and DrPeterVanNostrand as code owners May 24, 2023 13:30

Kubuxu reviewed May 24, 2023

View reviewed changes

cryptonemo approved these changes May 30, 2023

View reviewed changes

vmx merged commit a2b676c into master May 30, 2023

vmx deleted the improve-add-piece branch May 30, 2023 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: make add_piece taking less time #1707

fix: make add_piece taking less time #1707

vmx commented May 24, 2023

Kubuxu May 24, 2023

vmx May 24, 2023

cryptonemo commented May 24, 2023

vmx commented May 25, 2023

cryptonemo commented May 25, 2023

		@@ -7,11 +7,13 @@ use rayon::prelude::{ParallelIterator, ParallelSlice};

		use crate::{constants::DefaultPieceHasher, pieces::piece_hash};

		const BUFFER_SIZE: usize = 4096;

fix: make add_piece taking less time #1707

fix: make add_piece taking less time #1707

Conversation

vmx commented May 24, 2023

Kubuxu May 24, 2023

Choose a reason for hiding this comment

vmx May 24, 2023

Choose a reason for hiding this comment

cryptonemo commented May 24, 2023

vmx commented May 25, 2023

cryptonemo commented May 25, 2023