[英]Want to run Apache Beam Pipeline in parallel
My problem statement is 我的问题陈述是
As i am new to Beam , my question is 我刚接触Beam时,我的问题是
Beam has an ultimate plan of supporting many different sources (and eventually they can be even cross languages). Beam有一个支持许多不同来源的最终计划(最终它们甚至可以是跨语言的)。
to your questions, Multiple beam-runner-direct-java in parallel on the single machine won't cause problem. 提出您的问题,在一台机器上并行运行多个Beam-runner-direct-java不会造成问题。 In fact, all the validation tests uses direct runner and the tests do run in parallel. 实际上,所有验证测试都使用直接运行程序,并且这些测试确实并行运行。
One thing unclear is, what is the main reason that you have to create multiple pipelines, one for each 3rd party source? 尚不清楚的一件事是,您必须创建多个管道的主要原因是什么,每个第三方来源都需要一个? if the reason is to have things run parallel for higher throughput, I (biased opinion) think that is not a good idea. 如果原因是为了提高吞吐量而并行运行,我(有偏见)认为这不是一个好主意。 In the long run, even if we introduce feature optimizing parallel sources, you won't be able to benefit from the opt. 从长远来看,即使我们引入了优化并行源的功能,您也将无法从opt中受益。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.