in mem storage: got rid of the namespace indirection #344

alexsnaps · 2024-05-23T20:36:58Z

Looks better in this screenshot, but I will flamegraph and test this a little further on linux

chirino

lgtm

alexsnaps · 2024-05-27T13:10:54Z

Some further testing, with no `variables`, i.e actually hitting the change! Not that much difference... With the newly added bench scenario: #346

But if I grow the namespaces to a 1000...

Then it's a regression!

Now, that's all still on MacOS, I'll test these more rigorously on Linux next...

alexsnaps · 2024-05-27T13:29:05Z

Also... if a "tree" (namespace 1 -> * Limits -> 1 Counter) seem to be faster... how would storing these in a BTreeMap work out... 🤔

1000 namespaces with 50 limits each:

10 namespaces with 50 limits each

Well much... better. Anyways, Imma stop here and run all these on Linux and flamegraph some of these, but some of this would absolutely apply to our distributed storage /cc @chirino

didierofrivia · 2024-05-27T16:06:38Z

limitador/src/storage/in_memory.rs

 pub struct InMemoryStorage {
-    limits_for_namespace: RwLock<NamespacedLimitCounters<AtomicExpiringValue>>,
+    simple_limits: RwLock<HashMap<Limit, AtomicExpiringValue>>,


didierofrivia · 2024-05-27T16:20:43Z

limitador/src/storage/in_memory.rs

 use std::ops::Deref;
 use std::sync::{Arc, RwLock};
 use std::time::{Duration, SystemTime};

 pub struct InMemoryStorage {
-    simple_limits: RwLock<HashMap<Limit, AtomicExpiringValue>>,
+    simple_limits: RwLock<BTreeMap<Limit, AtomicExpiringValue>>,


alexsnaps · 2024-05-27T20:41:42Z

Did some more and, by now, relatively extensive testing... having the Limits comparable and store AtomicExpiringValue in a BTreeMap is giving the best results in most of the cases. When you have Limits with many (in the 100s) of Conditions, performance degrades. Still need to look into that further... But here is the callstack for Hash based vs Cmp based when searching for the Counters:

Hash based: `HashMap<Namespace, HashMap<Limit, AtomicExpiringValue>>`

Cmp based: `BTreeMap<Limit, AtomicExpiringValue>`

Giving us this kind of results:

Memory/check_rate_limited_and_update/10 namespaces with 50 limits each with 10 conditions and 0 vari...
                        time:   [253.02 µs 254.21 µs 255.86 µs]
                        change: [-9.1814% -8.4315% -7.7729%] (p = 0.00 < 0.05)
                        Performance has improved.

Zooming out...

Right now the worst offender tho, is not where we are looking for, in which ever storage version here btw:

RateLimiter::counters_that_applies spends all its time clone() the Limits (and composing parts) to then re-clone again and map into Counters... I'll have a look at that next.

What's next?

🛑 This cannot be merged! ⚠️

As of today we use the JSON serialized form of Limits to look counters up in Redis... It is highly unlikely that switching from HashMaps for our Limit field to BTreeMap would preserve the order we obtained in our own sorting functions.

So we first need to find a backwards compatible way of doing this all (i.e. having the crate::storage::keys::* functions keep on relying on the previous behavior, while using the new representation in-memory)

`Limiter::counters_that_applies`

I think we can probably store Arc<Limit> and use these instead of a full blown HashSet<Limit> on each request, requiring the cloning of all the Limits.

alexsnaps · 2024-09-12T14:34:17Z

🛑 This cannot be merged! ⚠️

As of today we use the JSON serialized form of Limits to look counters up in Redis... It is highly unlikely that switching from >HashMaps for our Limit field to BTreeMap would preserve the order we obtained in our own sorting functions.

I'll break the format anyways with more PRs coming… which is why we are releasing Limitador v2, with that in mind, maybe we are ok merging this?

Signed-off-by: Alex Snaps <alex@wcgw.dev>

didierofrivia

🪜

alexsnaps changed the title ~~Got rid of the namespace indirection~~ in mem storage: got rid of the namespace indirection May 23, 2024

chirino approved these changes May 24, 2024

View reviewed changes

alexsnaps force-pushed the inmem_single_lookup branch from da254ce to 7cc0ab9 Compare May 27, 2024 13:48

alexsnaps changed the base branch from main to bench May 27, 2024 13:49

alexsnaps force-pushed the inmem_single_lookup branch from 7cc0ab9 to 2676d01 Compare May 27, 2024 13:51

didierofrivia reviewed May 27, 2024

View reviewed changes

Base automatically changed from bench to main May 28, 2024 11:37

alexsnaps force-pushed the inmem_single_lookup branch 2 times, most recently from 88fc6f8 to d35a266 Compare September 12, 2024 14:32

alexsnaps marked this pull request as ready for review September 12, 2024 14:34

alexsnaps requested review from didierofrivia, eguzki and adam-cattermole September 12, 2024 14:34

alexsnaps added 2 commits September 12, 2024 10:46

Got rid of the namespace indirection

15f51ba

Signed-off-by: Alex Snaps <alex@wcgw.dev>

Use a BTreeMap in InMemoryStorage

0c08ce4

Signed-off-by: Alex Snaps <alex@wcgw.dev>

alexsnaps force-pushed the inmem_single_lookup branch from d35a266 to 0c08ce4 Compare September 12, 2024 14:47

didierofrivia approved these changes Sep 16, 2024

View reviewed changes

alexsnaps merged commit cdb9850 into main Sep 18, 2024
9 checks passed

alexsnaps deleted the inmem_single_lookup branch September 18, 2024 11:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in mem storage: got rid of the namespace indirection #344

in mem storage: got rid of the namespace indirection #344

alexsnaps commented May 23, 2024

chirino left a comment

alexsnaps commented May 27, 2024 •

edited

Loading

alexsnaps commented May 27, 2024 •

edited

Loading

didierofrivia May 27, 2024

didierofrivia May 27, 2024

alexsnaps commented May 27, 2024 •

edited

Loading

alexsnaps commented Sep 12, 2024

didierofrivia left a comment

in mem storage: got rid of the namespace indirection #344

in mem storage: got rid of the namespace indirection #344

Conversation

alexsnaps commented May 23, 2024

chirino left a comment

Choose a reason for hiding this comment

alexsnaps commented May 27, 2024 • edited Loading

alexsnaps commented May 27, 2024 • edited Loading

1000 namespaces with 50 limits each:

10 namespaces with 50 limits each

didierofrivia May 27, 2024

Choose a reason for hiding this comment

didierofrivia May 27, 2024

Choose a reason for hiding this comment

alexsnaps commented May 27, 2024 • edited Loading

Hash based: HashMap<Namespace, HashMap<Limit, AtomicExpiringValue>>

Cmp based: BTreeMap<Limit, AtomicExpiringValue>

Zooming out...

What's next?

🛑 This cannot be merged! ⚠️

Limiter::counters_that_applies

alexsnaps commented Sep 12, 2024

didierofrivia left a comment

Choose a reason for hiding this comment

alexsnaps commented May 27, 2024 •

edited

Loading

alexsnaps commented May 27, 2024 •

edited

Loading

alexsnaps commented May 27, 2024 •

edited

Loading

Hash based: `HashMap<Namespace, HashMap<Limit, AtomicExpiringValue>>`

Cmp based: `BTreeMap<Limit, AtomicExpiringValue>`

`Limiter::counters_that_applies`