简体繁体 English

AWS CLI S3 CP 性能非常慢

[英]AWS CLI S3 CP performance is painfully slow

原文 2018-10-02 09:49:50 7 4 amazon-web-services/ amazon-s3/ aws-cli

I've got an issue whereby uploads to and downloads from AWS S3 via the aws cli are very slow.我遇到了一个问题，即通过 aws cli 上传到 AWS S3 和从 AWS S3 下载的速度非常慢。 By very slow I mean it consistently takes around 2.3s for a 211k file which indicates an average download speed of less than 500Kb/s which is extremely slow for such a small file.我所说的非常慢是指一个 211k 的文件始终需要大约 2.3 秒，这表明平均下载速度低于 500Kb/s，这对于这么小的文件来说非常慢。 My webapp is heavily reliant on internal APIs and I've narrowed down that the bulk of the API's round-trip performance is predominantly related to uploading and downloading files from S3.我的 webapp 严重依赖于内部 API，我已经缩小了 API 的大部分往返性能主要与从 S3 上传和下载文件相关的范围。

Some details:一些细节：

Using the latest version of aws cli (aws-cli/1.14.44 Python/3.6.6, Linux/4.15.0-34-generic botocore/1.8.48) on an AWS hosted EC2 instance在 AWS 托管的 EC2 实例上使用最新版本的 aws cli (aws-cli/1.14.44 Python/3.6.6, Linux/4.15.0-34-generic botocore/1.8.48)
Instance is running the latest version of Ubuntu (18.04)实例正在运行最新版本 Ubuntu (18.04)
Instance is in region ap-southeast-2a (Sydney)实例位于 ap-southeast-2a 区域（悉尼）
Instance is granted role based access to S3 via a least privilege policy (ie minimum rights to the buckets that it needs access to)通过最小权限策略（即对它需要访问的存储桶的最小权限）授予实例基于角色的 S3 访问权限
Type is t2.micro which should have Inte.net Bandwidth of ~60Mb or so类型是 t2.micro，它应该有 ~60Mb 左右的 Inte.net 带宽
S3 buckets are in ap-southeast-2 S3 存储桶位于 ap-southeast-2
Same result with encrypted (default) and unencrypted files加密（默认）和未加密文件的结果相同
Same result with files regardless of whether they have a random collection of alpha numeric characters in the object name与文件相同的结果，无论它们是否在 object 名称中具有字母数字字符的随机集合
The issue persists consistently, even after multiple cp attempts and after a reboot the cp attempt consistently takes 2.3s即使在多次 cp 尝试和重启后 cp 尝试始终需要 2.3 秒，问题仍然存在
This leads me to wonder whether S3 or the EC2 instance (which is using a standard Inte.net Gateway) is throttled back这让我想知道 S3 或 EC2 实例（使用标准的 Inte.net 网关）是否受到限制
I've tested downloading the same file from the same instance to a webserver using wget and it takes 0.0008s (ie 8ms)我已经测试过使用 wget 从同一实例将同一文件下载到网络服务器，它需要 0.0008 秒（即 8 毫秒）

So to summarise:所以总结一下：

Downloading the file from S3 via the AWS CLI takes 2.3s (ie 2300ms)通过 AWS CLI 从 S3 下载文件需要 2.3 秒（即 2300 毫秒）
Downloading the same file from a webserver (> Inte.net > Cloudflare > AWS > LB > Apache) via wget takes 0.0008s (ie 8ms)通过 wget 从网络服务器（> Inte.net > Cloudflare > AWS > LB > Apache）下载相同的文件需要 0.0008 秒（即 8 毫秒）

I need to improve AWS CLI S3 download performance because the API is going to be quite heavily used in the future.我需要提高 AWS CLI S3 下载性能，因为 API 将来会被大量使用。

Any suggestions would be gratefully received.如有任何建议，我们将不胜感激。

4 个解决方案

Okay this was a combination of things.好吧，这是一个组合。

I'd had problems with the AWS PHP API SDK previously (mainly related to orphaned threads when copying files), so had changed my APIs to use the AWS CLI for simplicity and reliability reasons and although they worked, I encountered a few performance issues:我之前在使用 AWS PHP API SDK 时遇到过问题（主要与复制文件时的孤立线程有关），因此出于简单性和可靠性的原因，我已将 API 更改为使用 AWS CLI，尽管它们有效，但我遇到了一些性能问题：

Firstly because my Instance had role based access to my S3 buckets, the aws CLI was taking around 1.7s just to determine which region my buckets were in. Configuring the CLI to point to a default region overcame this首先，因为我的实例对我的 S3 存储桶具有基于角色的访问权限，aws CLI 需要大约 1.7 秒才能确定我的存储桶所在的区域。将 CLI 配置为指向默认区域克服了这一点
Secondly because PHP has to invoke a whole new shell when running an exec() command (eg exec("aws s3 cp s3://bucketname/objectname.txt /var/app_path/objectname.txt)) that is a very slow exercise. I know it's possible to offload shell commands via Gearman or similar but since simplicity was one of my goals, I didn't want to go down that road其次，因为 PHP 在运行 exec() 命令时必须调用一个全新的 shell（例如 exec("aws s3 cp s3://bucketname/objectname.txt /var/app_path/objectname.txt)），这是一个非常缓慢的练习. 我知道可以通过 Gearman 或类似方法卸载 shell 命令，但由于简单是我的目标之一，我不想走那条路
Finally because the AWS CLI uses Python, it takes almost 0.4s just to initiate, before it even begins processing a command.最后，因为 AWS CLI 使用 Python，所以在它甚至开始处理命令之前，它的启动时间几乎是 0.4 秒。 That might not seem like alot but when my API is in production usage it will be quite an impact to users and infrastructure alike这可能看起来不是很多，但是当我的 API 在生产中使用时，它会对用户和基础设施产生相当大的影响

To cut a long story short, I've done two things:长话短说，我做了两件事：

Reverted to using the AWS PHP API SDK instead of the AWS CLI恢复使用 AWS PHP API SDK 而不是 AWS CLI
Referring to the correct S3 region name within my PHP code在我的 PHP 代码中引用正确的 S3 区域名称

My APIs are now performing much better, ie From 2.3s to an average of around .07s.我的 API 现在表现更好，即从 2.3 秒到平均约 0.07 秒。

This doesn't make my original issue go away but at least performance is much better.这不会使我原来的问题消失，但至少性能要好得多。

I found that if I try to download an object using aws s3 cp , the download would hang close to finishing when the object size is greater than 500MB.我发现如果我尝试使用aws s3 cp下载对象，当对象大小大于 500MB 时，下载将挂起接近完成。

However, using get-object directly causes no hang or slowdown whatsoever.但是，直接使用get-object不会导致任何挂起或减速。 Therefore instead of using因此，而不是使用

aws s3 cp s3://my-bucket/path/to/my/object .

getting the object with获取对象

aws s3api get-object --bucket my-bucket --key path/to/my/object out-file

I experience no slowdown.我没有减速。

I've got an issue whereby uploads to and downloads from AWS S3 via the aws cli are very slow.我遇到了一个问题，即通过aws cli从AWS S3上进行上传和下载非常慢。 By very slow I mean it consistently takes around 2.3s for a 211k file which indicates an average download speed of less than 500Kb/s which is extremely slow for such a small file.非常慢，我的意思是，一个211k文件始终要花2.3 s左右的时间，这表明平均下载速度低于500Kb / s，对于这么小的文件来说这是非常慢的。 My webapp is heavily reliant on internal APIs and I've narrowed down that the bulk of the API's round-trip performance is predominantly related to uploading and downloading files from S3.我的webapp严重依赖于内部API，并且我缩小了范围，该API的往返性能主要与从S3上传和下载文件有关。

Some details:一些细节：

Using the latest version of aws cli (aws-cli/1.14.44 Python/3.6.6, Linux/4.15.0-34-generic botocore/1.8.48) on an AWS hosted EC2 instance在AWS托管的EC2实例上使用最新版本的aws cli（aws-cli / 1.14.44 Python / 3.6.6，Linux / 4.15.0-34-generic botocore / 1.8.48）
Instance is running the latest version of Ubuntu (18.04)实例正在运行最新版本的Ubuntu（18.04）
Instance is in region ap-southeast-2a (Sydney)实例位于ap-southeast-2a（悉尼）中
Instance is granted role based access to S3 via a least privilege policy (ie minimum rights to the buckets that it needs access to)实例通过最低特权策略（即，需要访问的存储桶的最低权限）被授予基于角色的S3访问权限
Type is t2.micro which should have Internet Bandwidth of ~60Mb or so类型为t2.micro，其互联网带宽应为〜60Mb左右
S3 buckets are in ap-southeast-2 S3存储桶位于ap-southeast-2中
Same result with encrypted (default) and unencrypted files加密（默认）和未加密文件的结果相同
Same result with files regardless of whether they have a random collection of alpha numeric characters in the object name文件的结果相同，无论对象名称中是否包含字母数字字符的随机集合
The issue persists consistently, even after multiple cp attempts and after a reboot the cp attempt consistently takes 2.3s即使经过多次cp尝试，问题仍然持续存在，并且在重新启动后cp尝试始终花费2.3秒
This leads me to wonder whether S3 or the EC2 instance (which is using a standard Internet Gateway) is throttled back这使我想知道S3或EC2实例（正在使用标准Internet网关）是否被限制
I've tested downloading the same file from the same instance to a webserver using wget and it takes 0.0008s (ie 8ms)我已经测试过使用wget从相同实例将相同文件下载到Web服务器，这需要0.0008s（即8ms）

So to summarise:因此，总结一下：

Downloading the file from S3 via the AWS CLI takes 2.3s (ie 2300ms)通过AWS CLI从S3下载文件需要2.3秒（即2300ms）
Downloading the same file from a webserver (> Internet > Cloudflare > AWS > LB > Apache) via wget takes 0.0008s (ie 8ms)通过wget从Web服务器（> Internet> Cloudflare> AWS> LB> Apache）下载相同文件需要0.0008s（即8ms）

I need to improve AWS CLI S3 download performance because the API is going to be quite heavily used in the future.我需要提高AWS CLI S3的下载性能，因为该API在将来会被大量使用。

Any suggestions would be gratefully received.任何建议将不胜感激。

AWS S3 is slow and painfully complex and you can't easily search for files. AWS S3 速度缓慢且极其复杂，您无法轻松搜索文件。 If used with cloudfront, it is faster and there are supposed to be advantages, but complexity shifts from very complex to insanely complex because caching obfuscates any file changes, and invalidating the cache is hit and miss unless you change the file name which involves changing the file name in the page referencing that file.如果与 cloudfront 一起使用，它会更快并且应该有优势，但是复杂性从非常复杂转变为非常复杂，因为缓存会混淆任何文件更改，并且使缓存无效是命中和未命中，除非您更改涉及更改文件名的文件名页面中引用该文件的文件名。

In practice, particularly if all or most of your traffic is located in the same region as your load balancer, I have found even a low specced web server located in the same region is faster by factors of 10. If you need multiple web servers attached to a common volume, AWS only provides this in certain regions, so I got around this by using NFS to share the volume on multiple web servers.在实践中，特别是如果您的所有或大部分流量与负载均衡器位于同一区域，我发现即使位于同一区域的低规格 Web 服务器的速度也快 10 倍。如果您需要连接多个 Web 服务器对于通用卷，AWS 仅在某些区域提供此功能，因此我通过使用 NFS 在多个 Web 服务器上共享卷来解决此问题。 This gives you a file system that is mounted on a server you can log in to and list and find files.这为您提供了一个安装在服务器上的文件系统，您可以登录并列出和查找文件。 S3 has become a turnkey solution for a problem that was solved better a couple of decades ago. S3 已经成为解决几十年前更好解决的问题的交钥匙解决方案。