简体   繁体   English

Python hadoop streaming:设置一个工作名称

[英]Python hadoop streaming : Setting a job name

  • I have a job that runs on my cluster using hadoop-streaming 我有一个使用hadoop-streaming在我的集群上运行的工作
  • I have to start a new job for which I want to add a job name, how can I pass that option on command-line or file to setup a job name? 我必须开始一个我想要添加作业名称的新工作,如何在命令行或文件中传递该选项以设置作业名称?
  • In Java, you can do this by saying 在Java中,您可以这样说
 JobConf conf = new JobConf(WordCount.class); conf.setJobName("wordcount"); 

How can I do that with hadoop-streaming? 我怎么能用hadoop-streaming做到这一点?

Configure the property using the -D key=value notation: 使用-D key = value表示法配置属性:

-D mapred.job.name='My Job'

You can list a whole bunch of options by calling the streaming jar with just the -info argument 您可以通过仅使用-info参数调用流jar来列出一大堆选项

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM