简体   繁体   English

Apache Beam Pipeline 能否用于批量编排?

[英]Can Apache Beam Pipeline be used for batch orchestration?

I am newbie in apache beam environment.我是 apache beam 环境的新手。 Trying to fit apache beam pipeline for batch orchestration.尝试为批量编排安装 apache 光束管道。

My definition of batch is as follows我对batch的定义如下

Batch==> a set of jobs,批处理==>一组作业,
Job==> can have one or more sub-job. Job==> 可以有一个或多个子作业。

There can be dependencies between jobs/sub-jobs.作业/子作业之间可能存在依赖关系。

Can apache beam pipeline be mapped with my custom batch?? apache 光束管道可以映射到我的自定义批次吗??

Apache Beam is unified for developing both batch and stream pipelines which can be run on Dataflow . Apache Beam 统一用于开发可以在Dataflow上运行的批处理和 stream 管道。 You can create and deploy your pipeline using Dataflow.您可以使用 Dataflow 创建和部署流水线。 Beam Pipelines are portable so that you can use any of the runners available according to your requirement. Beam Pipelines 是便携式的,因此您可以根据需要使用任何可用的运行器。

Cloud Composer can be used for batch orchestration as per your requirement. Cloud Composer可根据您的要求用于批量编排。 Cloud Composer is built on Apache Airflow. Both Apache Beam and Apache Airflow can be used together since Apache Airflow can be used to trigger the Beam jobs. Cloud Composer 建立在 Apache Airflow 之上。Apache Beam 和 Apache Airflow 可以一起使用,因为 Apache Airflow 可以用来触发 Beam 作业。 Since you have custom jobs running, you can configure the beam and airflow for batch orchestration.由于您有自定义作业在运行,您可以配置 beam 和 airflow 以进行批处理。

Airflow is meant to perform orchestration and also pipeline dependency management while Beam is used to build data pipelines which are executed data processing systems. Airflow 用于执行编排和管道依赖管理,而 Beam 用于构建执行数据处理系统的数据管道。

I believe Composer might be more suited for what you're trying to make.我相信Composer可能更适合您想要制作的东西。 From there, you can launch Dataflow jobs from your environment using Airflow operators (for example, in case you're using Python, you can use the DataflowCreatePythonJobOperator ).从那里,您可以使用 Airflow 运算符从您的环境启动数据流作业(例如,如果您使用 Python,则可以使用DataflowCreatePythonJobOperator )。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM