简体   繁体   English

从Internet直接将文件下载到我的S3存储桶

[英]Download a file from the Internet directly to my S3 bucket

I'm working with EMR (Elastic MapReduce) on AWS infrastructure and the default way to provide input files (large datasets) for programs is to upload them to an S3 bucket and reference those buckets from within EMR. 我正在使用AWS基础设施上的EMR(Elastic MapReduce),为程序提供输入文件(大型数据集)的默认方法是将它们上传到S3存储桶,并从EMR中引用这些存储桶。

Usually I download the datasets to my local,development machine and then upload them to S3, but this is getting harder to do with larger files, as upload speeds are generally much lower than download speeds. 通常,我将数据集下载到本地开发机器上,然后将其上传到S3,但这对大文件而言变得越来越困难,因为上传速度通常远低于下载速度。

My question is is there a way to download files from the internet (given their URL) directly into S3 so I don't have to download them to my local machine and then manually upload them? 我的问题是 ,有没有一种方法可以将文件从Internet( 直接提供给他们的URL) 直接下载到S3中,所以我不必将它们下载到本地计算机上然后手动上传它们?

不需要。您需要一个中介,通常是使用EC2实例而不是本地计算机来提高速度。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM