简体   繁体   English

在 Dataproc 的集群中查找 Hadoop 流式传输 jar

[英]Finding the Hadoop streaming jar in cluster in Dataproc

So I want to run a Python map reduce job on a Dataproc cluster, the problem is that I cannot find the Hadoop streaming jar file that needs to be submitted into the Main class or jar input. So I want to run a Python map reduce job on a Dataproc cluster, the problem is that I cannot find the Hadoop streaming jar file that needs to be submitted into the Main class or jar input.

I'm using Dataproc image with Hadoop 2.7, an answer for Hadoop 2.8 would also work.我正在使用带有 Hadoop 2.7 的 Dataproc 映像,Hadoop 2.8 的答案也可以。

Thanks谢谢

Hadoop Streaming jar located in /usr/lib/hadoop-mapreduce/ folder on Dataproc cluster nodes: /usr/lib/hadoop-mapreduce/hadoop-streaming.jar Hadoop 流 jar 位于 Dataproc 集群节点上的/usr/lib/hadoop-mapreduce/文件夹中: /usr/lib/hadoop-mapreduce/hadoop-streaming.jar

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM