简体   繁体   English

如何为 java 的 apache 光束管道配置火花流道

[英]How to configure spark runner for apache beam pipeline for java

I'm new to Apache Beam and Apace spark, i want to configure the beam pipeline for spark runner.我是 Apache Beam 和 Apace spark 的新手,我想为 spark runner 配置光束管道。 what is mentioned in the documentation is not clear to me.我不清楚文档中提到的内容。

You can pass argument --runner SparkRunner to the pipeline launcher to use spark as the underlying runner.您可以将参数--runner SparkRunner给管道启动器,以使用 spark 作为底层运行器。 Also, Please share what language of Beam SDK you are using.另外,请分享您使用的 Beam SDK 的语言。 Python and java have some what different ways to run on Spark via Beam. Python 和 java 有一些通过 Beam 在 Spark 上运行的不同方式。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何使用java中的Apache Beam直达写入BigTable? - How to write to BigTable using Apache Beam direct-runner in java? 如何将 Apache Beam 直接运行器添加到类路径? - How to add Apache Beam direct runner to classpath? 在Apache Beam管道中结合Java和Python - Combining Java and Python in Apache Beam pipeline org.apache.beam.sdk.util.UserCodeException 使用 Samza Runner 执行 Beam Pipeline - org.apache.beam.sdk.util.UserCodeException while executing Beam Pipeline using the Samza Runner 如何处理 apache 光束管道中的异常? - How to handle exception in a apache beam pipeline? Apache Beam与数据流运行器中的聚合器 - Aggregators in Apache beam with dataflow runner JAVA - Apache BEAM- GCP:GroupByKey 在 Direct Runner 中运行良好,但在 Dataflow runner 中失败 - JAVA - Apache BEAM- GCP: GroupByKey works fine with Direct Runner but fails with Dataflow runner 在 Flink 运行器上通过 KafkaIO 消费事件时,Apache Beam Pipeline 失败 - Failing Apache Beam Pipeline when consuming events through KafkaIO on Flink runner Java Apache Beam 测试管道用空值替换测试数据 - Java Apache Beam Testing pipeline replaces test data with null values Java:使用 apache 光束管道读取存储在存储桶中的 excel 文件 - Java: read excel file stored in a bucket using apache beam pipeline
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM