简体繁体中英

What does Hadoop gives to reducers?

原文 2016-02-06 22:51:34 0 1 java/ hadoop/ mapreduce/ distributed/ distributed-computing

After experimenting with 2 reducers , reading the HowManyMapsAndReduces from Hadoop Wiki, hadoop: number of reducers remains a constant 4 , Hadoop: Number of mappers and reducers and Setting the number of map tasks and reduce tasks I am driven in the conclusion that:

If I have 1 map (I understand that the number gets actually decided by Hadoop) and 2 reducers (where I actually provided only 1 file with the reducer code, eg -reducer /bin/wc ), then what will happen from the following?

Hadoop will distribute the data the mapper sends to both reducers (eg given 1000 lines of text, it will give ~500 to 1st reducer and ~500 to 2nd reducer)?
Hadoop will give all the data the mapper sends to both reducers (eg given 1000 lines of text, it will give 1000 to 1st reducer and 1000 to 2nd reducer)?

I think the 1st option, but I could not find evidence while searching the net.

1 answers

Option 1a: Hadoop will distribute data to the reducers, but it may not evenly divide it. There is no guarantee of balancing, especially if (1) your key distribution is skewed or (2) there are not a lot of records.

No of reducers in mapreduce hadoop

Hadoop Cannot set Reducers > 1

Parallelizing Ruby reducers in Hadoop?

Hadoop: Change number of reducers at runtime

Same key different Reducers (HADOOP)?

Hadoop reducers receiving wrong data

Hadoop - @Override error for mappers and reducers

Do Mappers and Reducers in Hadoop have to be static classes?

How to find time spent by mappers and reducers in Hadoop?

Hadoop - Sharing data between reducers through sockets

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question No of reducers in mapreduce hadoop Hadoop Cannot set Reducers > 1 Parallelizing Ruby reducers in Hadoop? Hadoop: Change number of reducers at runtime Same key different Reducers (HADOOP)? Hadoop reducers receiving wrong data Hadoop - @Override error for mappers and reducers Do Mappers and Reducers in Hadoop have to be static classes? How to find time spent by mappers and reducers in Hadoop? Hadoop - Sharing data between reducers through sockets

Related Tags

What does Hadoop gives to reducers?

Question

1 answers

solution1 4 ACCPTED 2016-02-06 23:03:42

solution1
4 ACCPTED 2016-02-06 23:03:42