[英]Hadoop Streaming Job with no input file
Is it possible to execute a Hadoop Streaming job that has no input file? 是否可以执行没有输入文件的Hadoop Streaming作业?
In my use case, I'm able to generate the necessary records for the reducer with a single mapper and execution parameters. 在我的用例中,我能够使用单个映射器和执行参数为化简器生成必要的记录。 Currently, I'm using a stub input file with a single line, I'd like to remove this requirement.
目前,我正在使用单行存根输入文件,我想删除此要求。
We have 2 use cases in mind. 我们有2个用例。
1) 1)
According to the docs this is not possible. 根据文档,这是不可能的。 The following are required parameters for execution:
以下是执行所需的参数:
It looks like providing a dummy input file is the way to go currently. 看起来提供虚拟输入文件是当前的方法。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.