简体   繁体   English

以分布式模式运行kafka connect?

[英]Running kafka connect in Distributed mode?

I have a total of 3 VM's(CloudVPS).我总共有 3 个虚拟机 (CloudVPS)。 Each of them has java, confluent open source installed on them.他们每个人都安装了 java、confluent 开源软件。 In VM1 I am running 3 processes of Splunk-sink-connector which reads from different topics and are running on different ports.在 VM1 中,我正在运行 3 个 Splunk-sink-connector 进程,它们从不同的主题读取并在不同的端口上运行。 And using REST calls I posted JSON configuration to each of them.我使用 REST 调用将 JSON 配置发布到它们中的每一个。

Since I am running in distributed mode I want to take advantage of other 2 VM's also.因为我在分布式模式下运行,所以我也想利用其他 2 个虚拟机。 Can anyone please tell me what to do, to add other 2 VM's to those 3 processes to achieve parallel processing.谁能告诉我该怎么做,将其他 2 个虚拟机添加到这 3 个进程以实现并行处理。

You just need to run Kafka Connect in Distributed mode on the three VMs, follow the instructions here and make sure you give them all the same group.id which identifies them as members of the same cluster (and thus eligible for sharing workload of tasks out across them).您只需要在三个虚拟机上以分布式模式运行 Kafka Connect, 按照此处的说明进行操作,并确保为它们提供相同的group.id ,将它们标识为同一集群的成员(因此有资格共享任务的工作负载)穿过它们)。 More config details for distributed mode here .分布式模式的更多配置细节在这里

See also:也可以看看:

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM