简体繁体中英

How do I specify a S3 bucket as my input to EMR

原文 2013-08-13 17:49:16 4 1 hadoop/ amazon-s3/ elastic-map-reduce

Instead of copying over to HDFS, is it possible to just get an array of objects in a bucket in S3 to be processed in EMR?

I've tried this and I keep on either getting security warnings for not having credentials (even after I add them to the configs) (this is from just doing new Path("s3n://...")) or running the jar tells me I am missing the AWS sdk when I try to use the AWS sdk to access my bucket.

1 answers

You can add it in the arguments section

While adding it as step select CustomJAR

JAR location: s3://inbsightshadoop/jar/loganalysis.jar
Main class: None
Arguments: s3://inbsightshadoop/insights-input s3://inbsightshadoop/insights-output
Action on failure: Terminate cluster

How do I copy a file from S3 to Amazon EMR in Data Pipeline after EMR is provisioned?

Running Custom JAR on Amazon EMR giving error ( Filesystem Error ) using Amazon S3 Bucket input and output

How do I configure Spark's "per bucket" settings for complex S3 bucket names?

How do I copy files from S3 to Amazon EMR HDFS?

AWS EMR encrypt S3 bucket using KMS

AWS EMR : setting hadoop credentials provider for S3 Bucket access

How do I set my EMR Classpath

How can I create Hive tables from AWS S3 bucket from my local machine?

How to read a file from s3 in EMR?

Amazon EMR: running Custom Jar with input and output from S3

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How do I copy a file from S3 to Amazon EMR in Data Pipeline after EMR is provisioned? Running Custom JAR on Amazon EMR giving error ( Filesystem Error ) using Amazon S3 Bucket input and output How do I configure Spark's "per bucket" settings for complex S3 bucket names? How do I copy files from S3 to Amazon EMR HDFS? AWS EMR encrypt S3 bucket using KMS AWS EMR : setting hadoop credentials provider for S3 Bucket access How do I set my EMR Classpath How can I create Hive tables from AWS S3 bucket from my local machine? How to read a file from s3 in EMR? Amazon EMR: running Custom Jar with input and output from S3

Related Tags

How do I specify a S3 bucket as my input to EMR

Question

1 answers

solution1 0 2014-08-21 07:31:59

solution1
0 2014-08-21 07:31:59