简体繁体中英

Why multiple MapReduce jobs for one pig / Hive job?

原文 2015-11-23 11:18:04 5 1 hadoop/ hive/ apache-pig

I am using Pig to run my hadoop job. When I run the pig script and then navigate to the YARN resource manager UI, I could see multiple MapReduce jobs getting created for the same Pig job? I believe it would be the same for Hive jobs as well.

Can anyone please let me know the reasoning behind this? On what basis would one pig job be split into multiple mapreduce jobs? One among them happens to be TempletonControllerJob.

Thanks

1 answers

Templeton Controller Job is like a Parent job which will call another child map-reduce job. It is basically to control the execution.

Before executing, Pig basically comes up with a execution plan - where it scans all the steps in the pig script and combines steps which can be executed in a single job. When there are two steps in the pig script which cannot be calculated in a single job, it splits it into two. Once it has done this combining and calculates the number of jobs and steps in each job to come up with the final result, it starts the execution.

Why was a hive mapreduce job killed?

PIG mapreduce output and HIVE

Hive Mapreduce Jobs failing

When should one use MapReduce instead of Pig/Hive?

SQL-HIVE-PIG -Mapreduce

Best practices when running Hadoop MapReduce jobs/Hive scripts/Pig scripts etc

Oozie for multiple mapreduce jobs

Hadoop interview query-Mapreduce-Pig-Hive

Hadoop's Hive/Pig, HDFS and MapReduce relationship

How to “insert into values” using Hive,Pig or MapReduce?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Why was a hive mapreduce job killed? PIG mapreduce output and HIVE Hive Mapreduce Jobs failing When should one use MapReduce instead of Pig/Hive? SQL-HIVE-PIG -Mapreduce Best practices when running Hadoop MapReduce jobs/Hive scripts/Pig scripts etc Oozie for multiple mapreduce jobs Hadoop interview query-Mapreduce-Pig-Hive Hadoop's Hive/Pig, HDFS and MapReduce relationship How to “insert into values” using Hive,Pig or MapReduce?

Related Tags

Why multiple MapReduce jobs for one pig / Hive job?

Question

1 answers

solution1 3 ACCPTED 2015-11-24 04:24:57

solution1
3 ACCPTED 2015-11-24 04:24:57