简体   繁体   English

kafka - 可以将python程序连接到hadoop集群外的Kafka吗?

[英]kafka - can python program connect to Kafka outside hadoop cluster?

I would like to write a python program which reads a file and writes the data to Kafka as a producer. 我想编写一个python程序,它读取文件并将数据作为生产者写入Kafka。 Kafka is setup in the Hadoop cluster and Python will be running outside Hadoop cluster. Kafka在Hadoop集群中设置,Python将在Hadoop集群之外运行。

is it possible to connect from outside? 可以从外面连接吗? do i need to install Kafka libraries for python? 我需要为python安装Kafka库吗? how do I need to install? 我该如何安装?

Easiest way to learn more about Kafka is to use http://landoop.com/docs/lenses/developers 了解Kafka的最简单方法是使用http://landoop.com/docs/lenses/developers

You will need to run 1 docker - to bring up everything locally and then developer a Python app, using the appropriate Kafka libs that connect to Kafka via the Kafka APIs and produce messages to it 您将需要运行1个docker - 在本地调出所有内容,然后开发Python应用程序,使用通过Kafka API连接到Kafka的相应Kafka库并向其生成消息

Once you've built your app - you can then package it and run it against your your Hadoop's Kafka brokers 一旦你构建了你的应用程序 - 然后你就可以打包它并对你的Hadoop的Kafka经纪人进行打包

is it possible to connect from outside? 可以从外面连接吗?

Depends on the firewall rules, but yes, it is possible 取决于防火墙规则,但是,有可能

do i need to install Kafka libraries for python? 我需要为python安装Kafka库吗? how do I need to install? 我该如何安装?

Yes. 是。 There are several Python libraries for Kafka. Kafka有几个Python库。 pip install kafka-python , for example. 例如, pip install kafka-python

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM