简体繁体中英

Kafka as a data store for future events

原文 2015-01-23 08:34:12 8 1 java/ events/ integration/ message-queue/ apache-kafka

I have a Kafka cluster which receives messages from a source based on data changes in that source. In some cases the messages are meant to be processed in the future. So I have 2 options:

Consume all messages and post messages that are meant for the future back to Kafka under a different topic (with the date in the topic name) and have a Storm topology that looks for topics with that date's name in it. This will ensure that messages are processed only on the day it's meant for.
Store it in a separate DB and build a scheduler that reads messages and posts to Kafka only on that future date.

Option 1 is easier to execute but my question is: Is Kafka a durable data store? And has anyone done this sort of eventing with Kafka? Are there any gaping holes in the design?

1 answers

You can configure the amount of time your messages stay in Kafka (log.retention.hours).

But keep in mind that Kafka is meant to be used as a "real-time buffer" between your producers and your consumers, not as durable data store. I don't think Kafka+Storm would be the appropriate tool for your use case. Why not just write your messages in some distributed file system, and schedule a job (MapReduce, Spark...) to process those events?

Store a complicated structure with data for future use

Apache kafka producer does not store data

Cucumber: best way to generate test data? and store it for future use?

Kafka producer future get timeout

Kafka with Domain Events

Kafka Stream Punctuator accessing local store data while rebuilding it

Eviction of data from Kafka key value state store

Store kafka data in hdfs as parquet format using flink?

Best way to store application data when data stored and data format could change in future versions?

Should I parse JSON once and store the data in an ArrayList for future use OR parse it every time I need the data?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Store a complicated structure with data for future use Apache kafka producer does not store data Cucumber: best way to generate test data? and store it for future use? Kafka producer future get timeout Kafka with Domain Events Kafka Stream Punctuator accessing local store data while rebuilding it Eviction of data from Kafka key value state store Store kafka data in hdfs as parquet format using flink? Best way to store application data when data stored and data format could change in future versions? Should I parse JSON once and store the data in an ArrayList for future use OR parse it every time I need the data?

Related Tags

Kafka as a data store for future events

Question

1 answers

solution1 0 2015-01-24 16:27:28

solution1
0 2015-01-24 16:27:28