简体繁体中英

Apache Flink State Store vs Kafka Streams

原文 2019-02-09 16:21:38 0 2 apache-kafka/ apache-flink/ apache-kafka-streams/ stream-processing

As far as I know handles Kafka Streams its States localy in memory or on disc or in a Kafka topic because all the input date is from a partition, where all the messages are keyed by a defined value. Most of the time the computations can be done without knowing the state of other Processors. If so, you have another Streams instance whichs calculsates the result. Like in this picture:

Where exactly does Flink store its States? Can Flink also store the states locally or does it always publish them always to all instances (tasks)? Is it possible to configure Flink so that it stores the States in a Kafka Broker?

2 answers

Flink also uses local stores (that can be keyed), similar to Kafka Streams. However, it does not write state into Kafka topics.

For fault-tolerance, it takes so-called "distributed snapshots", that are stored in a configurable state backend (eg, HDFS).

Check out the docs for more details:

There is a distinction between Flink and Kafka Streams. Flink is cluster framework, your code is deployed and run as job in Flink Cluster. Kafka streams is API that you embed in your standard java application. Stream processing logic runs inside the your application java process. They both can sink results to Kafka, key value store, database or external systems. Flink's master node implements its own high availability mechanism based on ZooKeeper and ensures the availability interim states after the disaster. If you are using Kafka Streams once you managed to save your interim states to Kafka Cluster you will have the same HA features provided by Kafka Cluster.

Kafka Streams vs Flink

Kafka Streams State Store range vs prefixScan

Kafka Consumer Vs Apache Flink

Kafka streams state store distribution

Kafka streams state store for what?

InvalidStateStoreException: the state store is not open in Kafka streams

Creating Global State Store in Kafka Streams (Spring)

Kafka Streams Processor API clear state store

number of partitions for global state store in kafka streams

Kafka Streams: State Store partition error

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Kafka Streams vs Flink Kafka Streams State Store range vs prefixScan Kafka Consumer Vs Apache Flink Kafka streams state store distribution Kafka streams state store for what? InvalidStateStoreException: the state store is not open in Kafka streams Creating Global State Store in Kafka Streams (Spring) Kafka Streams Processor API clear state store number of partitions for global state store in kafka streams Kafka Streams: State Store partition error

Related Tags

Apache Flink State Store vs Kafka Streams

Question

2 answers

solution1
3 2019-02-09 20:07:11

solution2
0 2021-10-30 12:47:29

Apache Flink State Store vs Kafka Streams

Question

2 answers

solution1 3 2019-02-09 20:07:11

solution2 0 2021-10-30 12:47:29

solution1
3 2019-02-09 20:07:11

solution2
0 2021-10-30 12:47:29