简体繁体中英

Reading Java properties files in Hadoop MapReduce applications

原文 2014-11-04 23:34:10 5 1 java/ hadoop/ mapreduce

I was wondering what is the standard practice for reading Java properties files in MapReduce applications and how to pass the location to it when submitting (starting) a job. In regular Java applications you can pass the location to the properties file as a JVM system property (-D) or argument to main method. What is the best alternative (standard practice) for this for MapReduce jobs? Some good examples would be very helpful.

1 answers

The best alternative is to use DistributedCache , however it may not be the standard way. There can be other ways. But I haven't seen any code using anything else so far.

The idea is to add the file to the cache, and read it inside setup method of map/reduce and load values into a Properties or a Map . If you need snippet I can add.

Oh I can remember, my friend JtheRocker used another approach. He set entire contents of the file against a key in the Configuration object, got it's value on setup then parsing & loading the pairs in a Map . In this case, file reading is done on the driver, which was previously on the task's side. While it's suitable for small files and seems cleaner, orthodox people may not like to pollute conf at all.

I would like to see, what other posts bring out.

Reading 2 input files in hadoop mapreduce

Reading many files hadoop mapreduce distributed cache

Reading large files using mapreduce in hadoop

Hadoop mapreduce 2 files filtering?

Hadoop MapReduce Output in JAVA

Hadoop MapReduce Java

Can I access Hadoop files in Java without using MapReduce?

Compare two files in Hadoop MapReduce

compiling class files for Hadoop MapReduce

Hadoop Mapreduce multiple Input files

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Reading 2 input files in hadoop mapreduce Reading many files hadoop mapreduce distributed cache Reading large files using mapreduce in hadoop Hadoop mapreduce 2 files filtering? Hadoop MapReduce Output in JAVA Hadoop MapReduce Java Can I access Hadoop files in Java without using MapReduce? Compare two files in Hadoop MapReduce compiling class files for Hadoop MapReduce Hadoop Mapreduce multiple Input files

Related Tags

Reading Java properties files in Hadoop MapReduce applications

Question

1 answers

solution1 2 ACCPTED 2014-11-05 07:25:18

solution1
2 ACCPTED 2014-11-05 07:25:18