简体繁体中英

Kinesis ProvisionedThroughputExceededException even after sufficient shards

原文 2016-04-25 06:56:08 5 3 amazon-web-services/ amazon-ec2/ amazon-kinesis

We have facing ProvisionedThroughputExceededException issue while writing data on Kinesis stream.

Case 1: We were used single m4.4xlarge (16 core, 64GB mem) instance to write data on stream pass 3k request from Jmeter, EC2 instance provides us 1100 request per second, So we choose 2 shard stream(ie 2000 eps). In result we was able to write data on stream successfully without any loss.

Case 2: For further testing we had created 10 EC2 m4.4xlarge (16 core, 64GB mem) cluster and 11 shard stream (based on simple calculation 1000eps for one shard, so 10 shard + 1 provision). When we test that EC2 cluster with different request cases from Jmeter like 3, 10, 30 millions. We receive ProvisionedThroughputExceededException error on our log file.

On Jmeter side EC2 cluster provides us 7500eps and i believe with 7500eps stream having 11000eps capacity should not return such error.

Could you help me to understand reason behind this issue.

3 answers

It sounds like Kinesis is not hashing/distributing your data evenly across your shards - some are "hot" (getting the ProvisionedThroughputExceededException ), while others are "cold".

To solve this, I recommend

Use the ExplicitHashKey parameter in order to have control over which shards your data goes to. The PutRecords documentation has some basic info on this (but not as much as it should).
Also, make sure that your shards are evenly split across the hash space (appropriate starting/ending hash key).

The simplest pattern is just to have a single pre-defined ExplicitHashKey for each shard, and have your PutRecords logic just iterate through it for each record - perfectly even distribution. In any case, make sure your record hashing algorithm will distribute records evenly across the shards.

Another alternative/extension based on using ExplicitHashKey is to have a subset of your hashspace dedicated to "overflow" shard(s) - in your case, 1 specific ExplicitHashKey value mapped to one shard - when you start being throttled on your normal shards, send the records there for retry.

Check your producer side, are you sure you are inserting data to different shards? "PartitionKey" value in PutRecordRequest call may help you.

I think you need to pass different "Partition Keys" for records to share data between different "Shards". Even if you have created multiple Shards and all of your records use the same partition key then you're still writing to a single shard, because they'll all have the same hash value. Check-out more here PartitionKey

Kinesis - handling write ProvisionedThroughputExceededException

What is shards in kinesis data stream

Kinesis Shards VS Partition Key

AWS Kinesis batching based on shards

AWS Lambda with Kinesis stream 2 shards

Is data lost when AWS Kinesis Stream returns “ProvisionedThroughputExceededException”?

How does Kinesis distribute shards among workers?

How are shards from a Kinesis stream assigned to multiple instances of a Kinesis consumer?

Amazon Kinesis: Caught exception while sync'ing Kinesis shards and leases

AWS Kinesis Data Stream, is it possible for a parition key to belong to multiple shards?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Kinesis - handling write ProvisionedThroughputExceededException What is shards in kinesis data stream Kinesis Shards VS Partition Key AWS Kinesis batching based on shards AWS Lambda with Kinesis stream 2 shards Is data lost when AWS Kinesis Stream returns “ProvisionedThroughputExceededException”? How does Kinesis distribute shards among workers? How are shards from a Kinesis stream assigned to multiple instances of a Kinesis consumer? Amazon Kinesis: Caught exception while sync'ing Kinesis shards and leases AWS Kinesis Data Stream, is it possible for a parition key to belong to multiple shards?

Related Tags

Kinesis ProvisionedThroughputExceededException even after sufficient shards

Question

3 answers

solution1
1 2016-07-08 06:55:24

solution2
0 2016-04-26 08:43:57

solution3
0 2019-06-10 20:03:11

Kinesis ProvisionedThroughputExceededException even after sufficient shards

Question

3 answers

solution1 1 2016-07-08 06:55:24

solution2 0 2016-04-26 08:43:57

solution3 0 2019-06-10 20:03:11

solution1
1 2016-07-08 06:55:24

solution2
0 2016-04-26 08:43:57

solution3
0 2019-06-10 20:03:11