How to consume messages from kafka producer in batches (kafka-python)

Question

I am having a kafka producer and consumer in python. I wish to consume messages from kafka producer in batches, let's say 2. From the producer, I have been sending email data like the following:

[{
    "email" : "sukhi215c@gmail.com",
    "subject": "Test 1",
    "message" : "this is a test"
},
{
    "email" : "sukhi215c@gmail.com",
    "subject": "Test 2",
    "message" : "this is a test"   
},
{
    "email" : "sukhi215c@gmail.com",
    "subject": "Test 3",
    "message" : "this is a test"   
},
{
    "email" : "sukhi215c@gmail.com",
    "subject": "Test 4",
    "message" : "this is a test"   
}]

I am trying to consume these data in batches. I wish to consume 2 message at a time and send emails based on those 2 data and consume the next set of data. The workaround that I tried is:

consumer = KafkaConsumer(topic, bootstrap_servers=[server], api_version=(0, 10))
for message in consumer[:2]:
    string = message.value.decode("utf-8")
    dict_value = ast.literal_eval(string)

The error that I am getting is:

    for message in consumer[:2]:
TypeError: 'KafkaConsumer' object is not subscriptable

Can someone help me getting through this?

Answer 1

The consumer is not a collection; it's iterator is infinite.

If you want to perform an action every two events, use a counter or your own list

data = []
consumer = KafkaConsumer(topic, bootstrap_servers=[server], api_version=(0, 10))
for message in consumer:
    data.append(message)
    if len(data) >= 2:
        action(data)
        data.clear()

Answer 2

Use the poll() interface documented here:

https://kafka-python.readthedocs.io/en/master/_modules/kafka/consumer/group.html#KafkaConsumer.poll

This allows you to set a timeout to return early if there are no messages to consume.

How to consume messages from kafka producer in batches (kafka-python)

Question

2 answers

solution1
1 2022-02-14 05:41:25

solution2
0 2022-07-06 04:44:55

How to consume messages from kafka producer in batches (kafka-python)

Question

2 answers

solution1 1 2022-02-14 05:41:25

solution2 0 2022-07-06 04:44:55

solution1
1 2022-02-14 05:41:25

solution2
0 2022-07-06 04:44:55