So I want to run a Python map reduce job on a Dataproc cluster, the problem is that I cannot find the Hadoop streaming jar file that needs to be submitted into the Main class or jar input.
I'm using Dataproc image with Hadoop 2.7, an answer for Hadoop 2.8 would also work.
Thanks
Hadoop Streaming jar located in /usr/lib/hadoop-mapreduce/
folder on Dataproc cluster nodes: /usr/lib/hadoop-mapreduce/hadoop-streaming.jar
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.