简体   繁体   English

如何安装Apache spark并使其与Kafka一起运行?

[英]How do I install Apache spark and get it up and running with Kafka?

I am quite new to Hadoop and Apache Spark. 我是Hadoop和Apache Spark的新手。 I am a beginner trying my hands on it. 我是一个初学者,请尝试一下。 Now, I am trying to try my hands on Apache Spark. 现在,我尝试尝试Apache Spark。 In order to do that, I am assuming I have to install a software named Apache Spark on my machine. 为此,我假设必须在计算机上安装名为Apache Spark的软件。

I tried to create a local machine using VM but I am lost at this point. 我试图使用VM创建本地计算机,但是此时我迷路了。 Is there any resource to help me configure and install Spark and Kafka in the same machine ? 是否有任何资源可以帮助我在同一台计算机上配置和安装Spark和Kafka?

You are in luck, Chris Fregley (from the IBM Spark TC) has a project which has docker images for all of these things working together (you can see it at https://github.com/fluxcapacitor/pipeline/wiki ). 幸运的是,Chris Fregley(来自IBM Spark TC)有一个项目,其中包含用于所有这些功能的docker映像,可以一起工作(您可以在https://github.com/fluxcapacitor/pipeline/wiki上看到它)。 For a "real" production deployment, you might want to look at deploying Spark on YARN or something similar - its deployment options are explained at http://spark.apache.org/docs/latest/cluster-overview.html and integrating it with Kafka is covered in the special Kafka integration guide http://spark.apache.org/docs/latest/streaming-kafka-integration.html . 对于“真正的”生产部署,您可能需要查看在YARN或类似设备上部署Spark-其部署选项在http://spark.apache.org/docs/latest/cluster-overview.html中进行了说明并进行了集成。特殊的Kafka集成指南http://spark.apache.org/docs/latest/streaming-kafka-integration.html中介绍了使用Kafka的问题。 Welcome to the wonderful of Spark I hope these help you get started :) 欢迎来到精彩的Spark,希望这些对您有所帮助:)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何获得在 Apache2 中运行的 Python cgi 脚本 - How do I get a Python cgi script running in Apache2 如何在 Spark Streaming + Kafka 中摆脱 NoSuchMethodError: org.apache.kafka.clients.consumer.KafkaConsumer.subscribe 错误 - How to get rid of NoSuchMethodError: org.apache.kafka.clients.consumer.KafkaConsumer.subscribe error in Spark Streaming + Kafka 如何在运行 Big Sur 11.0.1 的 Mac 上安装 Apache Airflow? - How do I install Apache Airflow on a Mac running Big Sur 11.0.1? 如何在Apache Spark中安装pyspark.pandas? - How to install pyspark.pandas in Apache Spark? 如何在Spark中为python 3.5安装numpy和pandas? - How do I install numpy and pandas for Python 3.5 in Spark? 如何设置CMakeLists.txt文件以启动并运行xtensor-python示例代码 - How do I setup a CMakeLists.txt file to get xtensor-python sample code up and running 如何仅通过命令行启动服务器并在虚拟机中运行? - How do I get a server up and running in a virtual machine through only command line? pymedia.audio.sound-如何启动并运行此模块? - pymedia.audio.sound - How do I get up and running with this module? Ubuntu通过apt-get安装apache spark - Ubuntu install apache spark via apt-get 如何在运行地图之前让Apache Spark工作者导入库 - How can I make Apache Spark workers to import libraries before running map
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM