[英]Finding the Hadoop streaming jar in cluster in Dataproc
So I want to run a Python map reduce job on a Dataproc cluster, the problem is that I cannot find the Hadoop streaming jar file that needs to be submitted into the Main class or jar input. So I want to run a Python map reduce job on a Dataproc cluster, the problem is that I cannot find the Hadoop streaming jar file that needs to be submitted into the Main class or jar input.
I'm using Dataproc image with Hadoop 2.7, an answer for Hadoop 2.8 would also work.我正在使用带有 Hadoop 2.7 的 Dataproc 映像,Hadoop 2.8 的答案也可以。
Thanks谢谢
Hadoop Streaming jar located in /usr/lib/hadoop-mapreduce/
folder on Dataproc cluster nodes: /usr/lib/hadoop-mapreduce/hadoop-streaming.jar
Hadoop 流 jar 位于 Dataproc 集群节点上的
/usr/lib/hadoop-mapreduce/
文件夹中: /usr/lib/hadoop-mapreduce/hadoop-streaming.jar
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.