"test" + "add" in a single call #68

piskvorky · 2015-09-23T02:43:21Z

In the (common?) scenario of "test if item already in filter, if not, add it; run additional business logic based on whether the element is new or not", we now have the choice of:

if item in bloom_filter:
    bloom_filter.add(item)
    ...code
else:
   ...other code

exists = bloom_filter.add(item)
if exists:
    ...code
else:
   ...other code

There's not much difference in performance (hashing is fast, memory caches primed), though the second approach is nicer. However, it sets the array bits even if they are already set (always writes through). This is unfortunate because the write access is unnecessary for duplicates and makes things more complicated and slower (more IO, cache stalls).

This PR adds a new method, contains_add, which acts just like add but only sets the bits if not already set:

exists = bloom_filter.contains_add(item)
if exists:
    ...code
else:
   ...other code

According to benchmarks, it's also about 11% faster than option 2) above, when expecting 20% duplicates.

axiak · 2015-09-23T02:49:36Z

I would expect option (2) to be faster, not slower. We should be within the same cache and there isn't any real I/O. Still, I can't argue with the numbers -- I suspect the difference is accounted for by extra python interpreter work, rather than just doing it all in C.

axiak · 2015-09-23T02:52:09Z

src/mmapbitarray.h

+        return -1;
+    }
+    DTYPE *chunk = array->vector + (size_t)(array->preamblesize + bit / (sizeof(DTYPE) << 3));
+    int byte = 1 << (bit % (sizeof(DTYPE) << 3));


can you use the _vector_byte static inline function here?

piskvorky · 2015-09-23T02:52:21Z

There is I/O -- processor caches can make a huge difference, and I think any write invalidates it, even when the write is a no-op (no bits changed). Memory writes are expensive.

axiak · 2015-09-23T02:52:59Z

src/mmapbitarray.h

+        errno = EINVAL;
+        return -1;
+    }
+    DTYPE *chunk = array->vector + (size_t)(array->preamblesize + bit / (sizeof(DTYPE) << 3));


and please use the _vector_offset static inline function here as well.

axiak · 2015-09-23T02:54:01Z

If you want to earn brownie points -- try writing a Cython version that calls out to the C test & add and see what the performance improvement is :)

piskvorky · 2015-09-23T03:11:20Z

Done. Let me know if you can replicate the performance improvements, I wonder how other HW / compiler factors play into this (OS X here, Apple LLVM version 7.0.0, MacBookPro11,3).

I merged the 32bit fix as well -- it modifies the same file, and I don't want conflicts later. Weirdly enough, even the pypy test passed now.

piskvorky · 2015-09-23T03:33:53Z

I think you're right -- I checked a version that simply writes instead of the if, and it's faster still.

So, either was just Python overhead that makes this faster, or the branch is even more expensive than a write (which kinda makes sense). Either way, contains_add is consistently faster than add -- maybe worth making it the default?

earonesty · 2016-02-25T19:21:18Z

Should make it the default, if it's typically faster.

add a new test-add method that doesn't write unless it has to

39fac58

piskvorky force-pushed the test_add branch from 39eb3fb to 39fac58 Compare September 23, 2015 02:47

axiak reviewed Sep 23, 2015
View reviewed changes

piskvorky added 2 commits September 23, 2015 11:58

Merge branch 'fix32bit' into test_add

efc71a0

use the inline statics instead of manual inline

a2ba510

write rather than branch

5c015a7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"test" + "add" in a single call #68

"test" + "add" in a single call #68

piskvorky commented Sep 23, 2015

axiak commented Sep 23, 2015

axiak Sep 23, 2015

piskvorky commented Sep 23, 2015

axiak Sep 23, 2015

axiak commented Sep 23, 2015

piskvorky commented Sep 23, 2015

piskvorky commented Sep 23, 2015

earonesty commented Feb 25, 2016

"test" + "add" in a single call #68

Are you sure you want to change the base?

"test" + "add" in a single call #68

Conversation

piskvorky commented Sep 23, 2015

axiak commented Sep 23, 2015

axiak Sep 23, 2015

Choose a reason for hiding this comment

piskvorky commented Sep 23, 2015

axiak Sep 23, 2015

Choose a reason for hiding this comment

axiak commented Sep 23, 2015

piskvorky commented Sep 23, 2015

piskvorky commented Sep 23, 2015

earonesty commented Feb 25, 2016