简体繁体中英

Using Apache Flume to write logs from MapReduce job into HDFS

原文 2014-04-14 18:59:01 2 1 java/ hadoop/ log4j/ hdfs/ flume

I am trying to write logs from MapReduce job into HDFS. I am using Apache Flume NG.

My environment:

Java 6
Log4j 1.2.16
Apache Hadoop 2.3.0
Apache Flume 1.4.0

Problem #1

I have created simple MapReduce job as Maven project and I have used logger.info() in my classes. When my job is completed I can see my logs in syslog file.

I would like to create my own log4j configuration and write logs to console too. How can I do this? Where do I have to put log4j.properties file? Should I modify general Hadoop conf/log4j.properties?

Problem #2

I would like to write logs to HDFS. But I don't want to use tail -f command and write the content of syslog file. I would like to write logs only from my classes - messages from logger.info() method.

Is this possible using Apache Flume NG? Or maybe can I do this easier?

I had an idea to implement Flume Log4j Appender in log4j.properties (for example on localhost, 44444 port). In Flume NG configuration I wanted to use the same address for Avro source and through memory channel write logs to HDFS.

Is this good solution?

1 answers

Problem #1

Which console? Remember the tasks are running on different JVMs. So there is no single console. If you want the logs from the Driver then that would be simple configuration.

Problem #2

What you are attempting is a generally a good solution. Flume appender is available in log4j project : Log4J 2 Flume Appender

1 : http://logging.apache.org/log4j/2.x/manual/appenders.html#FlumeAppender or the other option : Kite SDK

Cannot write from into remote HDFS using flume

How to query an embedded database stored in hdfs from a Mapreduce job?

Write data from Hadoop MapReduce job into MySQL

mapreduce using NIO on HDFS

Write an empty MapReduce job

Error in moving log files from local file system to HDFS via Apache Flume

Deserializing Json file and sink into HDFS using flume

Getting mapreduce job result from Java API without permanent storage on HDFS

Write to Hive in Java MapReduce Job

How to write logs in a file using apache spark

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Cannot write from into remote HDFS using flume How to query an embedded database stored in hdfs from a Mapreduce job? Write data from Hadoop MapReduce job into MySQL mapreduce using NIO on HDFS Write an empty MapReduce job Error in moving log files from local file system to HDFS via Apache Flume Deserializing Json file and sink into HDFS using flume Getting mapreduce job result from Java API without permanent storage on HDFS Write to Hive in Java MapReduce Job How to write logs in a file using apache spark

Related Tags

Using Apache Flume to write logs from MapReduce job into HDFS

Question

1 answers

solution1 0 2014-04-14 20:18:49

solution1
0 2014-04-14 20:18:49