[英]Limit concurrent executions on AWS Data Pipeline
Is there a way to limit concurrent execution on an AWS Data Pipeline? 有没有办法限制AWS Data Pipeline上的并发执行? We need to limit simultaneous executions to 1.
我们需要将同时执行限制为1。
Something similar to what Oozie has with the <concurrency>
property? 类似于Oozie的
<concurrency>
属性吗?
From the oozie docs: 来自oozie docs:
concurrency: The maximum number of actions for this job that can be running at the same time. 并发:可以同时运行此作业的最大操作数。 This value allows to materialize and submit multiple instances of the coordinator app, and allows operations to catchup on delayed processing.
此值允许实现并提交协调器应用程序的多个实例,并允许操作赶上延迟的处理。 The default value is 1 .
默认值为1。
You can use maxActiveInstances field under EC2Resource / EmrCluster to achieve this. 您可以使用EC2Resource / EmrCluster下的maxActiveInstances字段来实现此目的。
References - 参考-
https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-object-emrcluster.html https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-object-emrcluster.html
https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-object-ec2resource.html https://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/dp-object-ec2resource.html
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.