简体   繁体   中英

Finding the Hadoop streaming jar in cluster in Dataproc

So I want to run a Python map reduce job on a Dataproc cluster, the problem is that I cannot find the Hadoop streaming jar file that needs to be submitted into the Main class or jar input.

I'm using Dataproc image with Hadoop 2.7, an answer for Hadoop 2.8 would also work.

Thanks

Hadoop Streaming jar located in /usr/lib/hadoop-mapreduce/ folder on Dataproc cluster nodes: /usr/lib/hadoop-mapreduce/hadoop-streaming.jar

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM