简体   繁体   中英

Is using Kafka as an input source for Flink a performance bottleneck?

Flink允许阅读Kafka主题,这是性能瓶颈使Flink整体变慢吗?

Kafka partitions can scale horizontally to accomodate for higher thoughput.

One Flink consumer thread can only be assigned to one Kafka partition.

So, if you have only 1 Kafka partition, and N+1 Flink executors, then you will have N idle tasks, which could be a bottleneck, sure, but that is a tradeoff of having total-ordering within a Kafka topic, not necessarily a Flink problem.

Otherwise, you would create your Kafka topics with ten to hundreds of partitions, and Flink would be fine to consume it.

It depends. If, for example, you have a slow network connection between your Kafka cluster and your Flink cluster, then that will become a performance bottleneck.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM