Scalability of KEDA with Kafka scaler? #601

alexandery · 2020-02-03T03:48:43Z

Is there any data on how scalable KEDA with Kafka scaler is? Meaning - how many topics and consumer groups can I safely monitor and use for scaling deployments? I suspect the scalability would be somewhat different for different scalers.

Would I be able to monitor 10,000 topics with average of 3 consumer groups per topic? This would result in 30,000 ScaledObjects.

I was unable to find out whether KEDA runs as a single instance or multiple, distributing the load between instances.

Would appreciate any insight, pointers, suggestions.

Thank you!

P.S. Apologies for "bug" label - I wasn't sure which category to post this under.

anirudhgarg · 2020-02-03T06:26:36Z

Currently KEDA runs in a single pod. There has been some discussion on extending KEDA to work with multiple pods or with multiple namespaces but that hasnt been implemented.

PS: Added a new tag for "clarification-functionality"

alexandery · 2020-02-05T01:09:03Z

@anirudhgarg Thank you.

Is it a fair assumption that any scaler runs as a component within KEDA and as such would be limited to general limitations of the technology it monitors? With Kafka client side libraries there is definitely a limit to how many partitions one process can work with comfortably (at least Confluent ones that I've used). Would these same limits apply here?

I think MS owns Kafka scaler - https://keda.sh/scalers/apache-kafka-topic/ . Since you are from MS, do you know who would be able to answer my question?

alexandery · 2020-02-19T04:39:43Z

bump...

tomkerkhove · 2020-05-04T06:52:29Z

We are open for any contributions, regardless of who owns it!

So your concern is that its not checking all the Kafka partitions or am I getting your question wrong?

alexandery · 2020-05-05T00:17:46Z

@tomkerkhove My concern is the capacity of Kafka scaler within KEDA - if I need to configure scaling rules for 500-1000 (or few thousands) deployments based on 1-2-5 topics each in my Kafka cluster (which would result in at least 1-2K of topics being actively monitored by KEDA) - would KEDA support that?

I don't know how exactly Kafka scaler in KEDA monitors topics but I do know that there is a very real limit on how many Kafka partitions one process can connect to - so if Kafka scaler is a single process, then it won't be able to monitor more than 2500-3000 partitions with the Kafka client libraries I'm familiar with. That is why I was wondering about the capacity aspects of Kafka scaler.

zroubalik · 2020-05-05T10:43:55Z

@alexandery you can set KEDA operator to watch only particular namespace -> this allows you to deploy multiple KEDA instances in the cluster. But there could be only one Metrics Server in the cluster, so it would be shared.
Unfortunately I am not aware of any data regarding your question. If you have the resources it would be really great, if you could try to perform such load on KEDA and share the results with us!

alexandery · 2020-06-18T00:24:38Z

@zroubalik Thanks for your reply. Sorry, was busy with something else and am getting back to KEDA now.

Could you please elaborate a little on single Metrics server and multiple KEDA instances? For a second there I thought I could deploy KEDA into a namespace, but that's clearly not the case.

I can see what I can do to simulate the scenario I was talking about. I need to know whether we can use KEDA based on its the limitations.

zroubalik · 2020-06-24T07:34:51Z

@alexandery sorry, I have missed your comment somehow.

You can deploy multiple KEDA Operator instances, you need to set the WATCH_NAMESPACE variable for each deployment, ie. you can have one operator running in namespace foo the second in namespace bar and each controlling a specified namespace. Just be careful about the setting, 2 or more operators can't watch the same namespace.
Unfortunately due to the current k8s api server design, you can't deploy multiple KEDA Metrics Servers (it is the source of the metrics for HPAs which are perfoming the scaling itself). So there will be just one deployment running.

The real bottleneck could be then the Metrics Server, but I don't have any specific data on this. If you are planning to do some load/performance tests evaluation, it would be great to share the results then. Interesting scenario would be to compare results between scaling of large amount of ScaledObjects by single KEDA instance and by multiple KEDA operators, to see if/how much the single Metrics Server affects/limits the scaling.

mariusgrigoriu · 2020-12-23T23:07:15Z

Seems like the WATCH_NAMESPACE environment variable is not being honored by the operator.

zroubalik · 2021-01-04T09:49:56Z

How so?

mariusgrigoriu · 2021-01-04T18:00:12Z

While the adapter does honor the variable, the operator does not. Set the variable for the operator to a single namespace and you'll see in the logs that it still operates on all namespaces.

Upon inspecting the operator's code, unlike the adapter, you'll see that the namespace is not set when calling NewManager. We will submit a PR soon.

tomkerkhove · 2021-01-04T18:17:07Z

Thanks!

stale · 2021-10-13T20:08:41Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

stale · 2021-10-20T21:06:06Z

This issue has been automatically closed due to inactivity.

Signed-off-by: Jorge Turrado <jorge.turrado@docplanner.com>

alexandery added the bug Something isn't working label Feb 3, 2020

herinckc mentioned this issue Jan 5, 2021

Update operator to support the WATCH_NAMESPACE env var #1474

Merged

2 tasks

stale bot added the stale All issues that are marked as stale due to inactivity label Oct 13, 2021

stale bot closed this as completed Oct 20, 2021

SpiritZhou pushed a commit to SpiritZhou/keda that referenced this issue Jul 18, 2023

Add missing folder after the release (kedacore#601)

c1fd0a6

Signed-off-by: Jorge Turrado <jorge.turrado@docplanner.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scalability of KEDA with Kafka scaler? #601

Scalability of KEDA with Kafka scaler? #601

alexandery commented Feb 3, 2020 •

edited

Loading

anirudhgarg commented Feb 3, 2020

alexandery commented Feb 5, 2020

alexandery commented Feb 19, 2020

tomkerkhove commented May 4, 2020

alexandery commented May 5, 2020

zroubalik commented May 5, 2020 •

edited

Loading

alexandery commented Jun 18, 2020

zroubalik commented Jun 24, 2020

mariusgrigoriu commented Dec 23, 2020

zroubalik commented Jan 4, 2021

mariusgrigoriu commented Jan 4, 2021

tomkerkhove commented Jan 4, 2021

stale bot commented Oct 13, 2021

stale bot commented Oct 20, 2021

Scalability of KEDA with Kafka scaler? #601

Scalability of KEDA with Kafka scaler? #601

Comments

alexandery commented Feb 3, 2020 • edited Loading

anirudhgarg commented Feb 3, 2020

alexandery commented Feb 5, 2020

alexandery commented Feb 19, 2020

tomkerkhove commented May 4, 2020

alexandery commented May 5, 2020

zroubalik commented May 5, 2020 • edited Loading

alexandery commented Jun 18, 2020

zroubalik commented Jun 24, 2020

mariusgrigoriu commented Dec 23, 2020

zroubalik commented Jan 4, 2021

mariusgrigoriu commented Jan 4, 2021

tomkerkhove commented Jan 4, 2021

stale bot commented Oct 13, 2021

stale bot commented Oct 20, 2021

alexandery commented Feb 3, 2020 •

edited

Loading

zroubalik commented May 5, 2020 •

edited

Loading