简体繁体中英

How does mapper output get written to HDFS in case of Sqoop?

原文 2015-05-14 19:19:33 1 1 java/ hadoop/ mapreduce/ hdfs/ sqoop

As I have learned about Hadoop Map-Reduce jobs that mapper output is written to local storage and not to HDFS, as it is ultimately a throwaway data and so no point of storing in HDFS.

But as I see in case of Sqoop mapper output file part-m-00000 is written into HDFS. So my doubt is whether there is some setting in Hadoop to control where mapper output gets written to? And it is set to local storage by default?

1 answers

If there are no reducers then mapper output is written to HDFS. Even in this case mapper output is not directly written to HDFS but written on individual node disk and then copied over to HDFS.

Sqoop is one scenario where it is typically a map only job wherein you want o get data from a table in parallel but you do not need to reduce data on any condition.

Check this link : Identity Reducer vs zero reducer

How to load SR parser file in hdfs in the mapper?

Running hdfs -text command within mapper and using output

how to get multiple output (k, v) in one mapper?

When mutiple MapReduce jobs are chained, is the output of each written to the HDFS?

I have written this code but it does output the value associated with sam key value pair? please tell me how to get the value of sam?

Sqoop, export HDFS to MySQL in Java

How does Hadoop get input data not stored on HDFS?

How to Output Mapper Key as Text to the Reducer in hadoop

How to mock mapstruct mapper output in a Spring Service?

Data does not get written to printwriter

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to load SR parser file in hdfs in the mapper? Running hdfs -text command within mapper and using output how to get multiple output (k, v) in one mapper? When mutiple MapReduce jobs are chained, is the output of each written to the HDFS? I have written this code but it does output the value associated with sam key value pair? please tell me how to get the value of sam? Sqoop, export HDFS to MySQL in Java How does Hadoop get input data not stored on HDFS? How to Output Mapper Key as Text to the Reducer in hadoop How to mock mapstruct mapper output in a Spring Service? Data does not get written to printwriter

Related Tags

How does mapper output get written to HDFS in case of Sqoop?

Question

1 answers

solution1 2 ACCPTED 2015-05-14 21:19:58

solution1
2 ACCPTED 2015-05-14 21:19:58