简体   繁体   English

使用 AWS Datapipeline 将 DynamoDB 导出到 S3 数据

[英]Export DynamoDB to S3 data using AWS Datapipeline

I have a DynamoDB table storing 1Gb of data.我有一个存储 1Gb 数据的 DynamoDB 表。 RCU and WCU are 1000 each. RCU 和 WCU 各为 1000。 I set up a Data pipeline to export this 1 GB of data to s3.我设置了一个数据管道来将这 1 GB 的数据导出到 s3。 The entire 1GB of data is exported to s3 in partitions.整个 1GB 的数据在分区中导出到 s3。 My question is what decides the number and size of these partitions?我的问题是什么决定了这些分区的数量和大小?

mightyMouse,强大的鼠标,

In his recent video "AWS Re:Invent Amazon DynamoDB advanced design patterns – Part 1," Rick Houlihan demonstrates setting 100k WCU provisioned capacity at table creation and mentions that each extra 1000 WCU provisioned will add 1 partition, thus his table starts with around 100 partitions.在他最近的视频“AWS Re:Invent Amazon DynamoDB 高级设计模式 – 第 1 部分”中,Rick Houlihan 演示了在创建表时设置 100k WCU 预置容量,并提到每额外预置 1000 个 WCU 将添加 1 个分区,因此他的表从大约 100 个开始分区。 Partitions are internally limited to 10gb, splitting into two partitions if a partition hits this limit, but otherwise they will stay within one.分区在内部限制为 10gb,如果分区达到此限制,则分成两个分区,否则它们将保持在一个范围内。 This suggests that potentially all of your data still sits just within one partition.这表明您的所有数据可能仍仅位于一个分区内。

All this seems to be abstracted/hidden away from users, but generally you should be able to calculate the estimated number of partitions.所有这些似乎都被抽象/隐藏在用户之外,但通常您应该能够计算出估计的分区数。

With On-Demand mode AWS says they will automatically increase the throughput of your table by double the previous limit. AWS 表示,在按需模式下,它们会自动将您的表的吞吐量提高两倍于之前的限制。 The wording is a little bit weird, but I believe this ends up with them creating more partitions for you as a partition gets queried or written to faster than it can handle.措辞有点奇怪,但我相信这最终会导致他们为您创建更多分区,因为分区的查询或写入速度超出了它的处理速度。

Something to note is that many people have mentioned that once a partition has been provisioned you can lower the number of WCU to what you actually need and still keep the provisioned partitions.需要注意的是,许多人提到,一旦配置了分区,您可以将 WCU 的数量降低到您实际需要的数量,并且仍然保留配置的分区。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM