简体繁体中英

How to copy files from HDFS to S3 effectively programatically

原文 2010-09-14 18:09:37 1 1 amazon-s3/ hadoop/ hdfs

My hadoop job generate large number of files on HDFS and I want to write a separate thread which will copy these files from HDFS to S3.

Could any one point me to any java API that handles it.

Thanks

1 answers

"Support for the S3 block filesystem was added to the ${HADOOP_HOME}/bin/hadoop distcp tool in Hadoop 0.11.0 (See HADOOP-862). The distcp tool sets up a MapReduce job to run the copy. Using distcp, a cluster of many members can copy lots of data quickly. The number of map tasks is calculated by counting the number of files in the source: ie each map task is responsible for the copying one file. Source and target may refer to disparate filesystem types. For example, source might refer to the local filesystem or hdfs with S3 as the target. "

Check out Running Bulk Copies in and out of S3 here http://wiki.apache.org/hadoop/AmazonS3

Copy and extract files from s3 to HDFS

How to get files from HDFS to S3

incrementally copy files from S3 to local hdfs

Can distcp be used to copy a directory of files from S3 to HDFS?

How do I copy files from S3 to Amazon EMR HDFS?

Copy and unzip from S3 to HDFS

Copy from S3 TO HDFS Using Spark

How to upload large files from HDFS to S3

Copy files from S3 to HDFS using distcp or s3distcp

Copy files from HDFS to Amazon S3 using distp and s3a scheme

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Copy and extract files from s3 to HDFS How to get files from HDFS to S3 incrementally copy files from S3 to local hdfs Can distcp be used to copy a directory of files from S3 to HDFS? How do I copy files from S3 to Amazon EMR HDFS? Copy and unzip from S3 to HDFS Copy from S3 TO HDFS Using Spark How to upload large files from HDFS to S3 Copy files from S3 to HDFS using distcp or s3distcp Copy files from HDFS to Amazon S3 using distp and s3a scheme

Related Tags

How to copy files from HDFS to S3 effectively programatically

Question

1 answers

solution1 9 ACCPTED 2010-09-16 02:30:43

solution1
9 ACCPTED 2010-09-16 02:30:43