简体   繁体   中英

How to run EMR steps first sequentially and then in parallel?

在此处输入图像描述

This process can be achieved easily with the help of orchestration engine like Step Function but for some reason we cannot use. Here, I want to understand can we do some kind of code manipulation to run parallel execution after step2 where step 3 and step4 will run in parallel and after step 4 it will run step 5 in sequence.

Any reference how can I achieve this or anyone does so far? Thanks

As you said you can't go with Step Function for some reason, you can try Oozie.

Oozie is a workflow scheduler system to manage Apache Hadoop jobs. Its also a EMR managed service, one click to install. Oozie jobs are Directed Acyclical Graphs (DAGs).

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM