简体   繁体   中英

Does Spark read data from Kafka partition into executor, for a batch which is queued?

During spark streaming with streaming-kafka-0-8-integration Direct Approach, If the batches are getting queued, will the executors pull the data for queued batches into their memory? If not, what is the harm in having a very long backlog of batches?

是的,Spark将从Kafka Queue中提取数据并在内存上进行处理,并且由于Kafka的批次积压很长,因此危害将是对Kafka资源的压力。

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM