简体   繁体   English

是否有 AWS 点中断率随时间变化的数据?

[英]Is there data for AWS spot interruption rate over time?

We are running an EMR cluster with spot instances as task nodes.我们正在运行一个以 spot 实例作为任务节点的 EMR 集群。 The EMR cluster is executing spark jobs which sometimes run for several hours. EMR 集群正在执行有时会运行几个小时的 spark 作业。 Interruptions of spot instances can cause the failure of the spark job which then requires us to restart the job entirely. spot 实例的中断会导致 spark 作业失败,然后需要我们完全重新启动该作业。

I can see that there is some basic information on the "Frequency of interruption" on AWS Spot Advisor - However, this data seems to be very generic, I can't see historic trends and I also miss the probability of interruption based on how long the spot instance is running (which should have a significant impact on the probability of interruption).我可以看到AWS Spot Advisor上有一些关于“中断频率”的基本信息——但是,这个数据似乎很笼统,我看不到历史趋势,我也错过了基于多长时间中断的概率Spot 实例正在运行(这应该对中断概率有重大影响)。

Is this data available somewhere?这些数据在某处可用吗? Or are there other data points that can be used as proxy?或者是否有其他数据点可以用作代理?

I found this Github issue which provides a link to this JSON file in Spot Advisor S3 bucket that includes interruption rates.我发现了这个 Github问题,它提供了指向 Spot Advisor S3 存储桶中的这个 JSON 文件的链接,其中包括中断率。

https://spot-bid-advisor.s3.amazonaws.com/spot-advisor-data.json https://spot-bid-advisor.s3.amazonaws.com/spot-advisor-data.json

AWS Spot Instances are spare computing capacity in the AWS Cloud that is available at discounted prices. AWS Spot 实例是 AWS 云中以折扣价提供的备用计算容量。 These instances can be interrupted by Amazon EC2 with two minutes of notification when the capacity is needed elsewhere.当别处需要容量时,Amazon EC2 可以通过两分钟的通知中断这些实例。

AWS provides data on the Spot Instance interruption rate as a percentage of the total number of instances in the region. AWS 提供有关 Spot 实例中断率的数据,以占该地区实例总数的百分比表示。 This data is available in the AWS Management Console and can be accessed by going to the EC2 dashboard and selecting "Spot Request History" under the "Spot Requests" tab.此数据在 AWS 管理控制台中可用,可以通过转到 EC2 仪表板并选择“Spot Requests”选项卡下的“Spot Request History”来访问。

You can also use the AWS CLI or the AWS SDKs to retrieve this data programmatically.您还可以使用 AWS CLI 或 AWS 开发工具包以编程方式检索此数据。 The describe-spot-instance-request-history command in the AWS CLI can be used to retrieve the Spot Instance interruption rate for a specific region and time period. AWS CLI 中的 describe-spot-instance-request-history 命令可用于检索特定区域和时间段的 Spot 实例中断率。

Here is an example of how you can use the AWS CLI to retrieve the Spot Instance interruption rate for the us-east-1 region for the past week:以下是如何使用 AWS CLI 检索 us-east-1 区域过去一周的 Spot 实例中断率的示例:

aws ec2 describe-spot-instance-request-history --region us-east-1 --start-time $(date -v-7d '+%Y-%m-%dT%T')

Note that the Spot Instance interruption rate can vary over time depending on the demand for computing capacity in the region and the availability of spare capacity.请注意,Spot 实例中断率可能会随时间变化,具体取决于该地区对计算容量的需求和备用容量的可用性。 It is also important to note that the interruption rate is not guaranteed and can vary based on the instance type, Availability Zone, and other factors.还需要注意的是,中断率无法保证,可能会因实例类型、可用区和其他因素而异。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM