简体   繁体   English

通过 AWS Kinesis Data Analytics 使用 DynamoDB Streams

[英]Consuming DynamoDB Streams with AWS Kinesis Data Analytics

I am using Kinesis Firehose to consume Dyanamo DB streams through lambda and pushing those records to S3 bucket, Glue job is running every hour to pick the records from S3 , perform deduplication and then finally insert the records to Redshift.我正在使用 Kinesis Firehose 通过 lambda 使用 Dyanamo DB 流并将这些记录推送到 S3 存储桶,Glue 作业每小时运行一次以从 S3 中选取记录,执行重复数据删除,然后最终将记录插入到 Redshift。

在此处输入图片说明

Is there any way I can consume the records from Dynamo Streams to 'Kinesis Data Analytics' and then perform deduplication here and insert the records in Redshift?有什么方法可以将 Dynamo Streams 中的记录用于“Kinesis Data Analytics” ,然后在此处执行重复数据删除并将记录插入 Redshift 中?

I have gone through some links https://issues.apache.org/jira/browse/FLINK-4582 , Consume DynamoDB streams in Apache Flink .我浏览了一些链接https://issues.apache.org/jira/browse/FLINK-4582在 Apache Flink 中使用 DynamoDB 流

  • Here it is mentioned that we can use FlinkKinesisConsumer to consume DynamoDB streams这里提到我们可以使用FlinkKinesisConsumer来消费 DynamoDB 流

. . So Can we use this FlinkKinesisConsumer in Kinesis Data Analytics and then consume the Dynamo Stream directly?那么我们可以在 Kinesis Data Analytics 中使用这个 FlinkKinesisConsumer,然后直接消费 Dynamo Stream 吗?

While using Flink as Runtime for Kinesis Data Analytics.使用 Flink 作为 Kinesis Data Analytics 的运行时。

sources : https://docs.aws.amazon.com/kinesisanalytics/latest/java/how-sources.html来源https : //docs.aws.amazon.com/kinesisanalytics/latest/java/how-sources.html

'FlinkKinesisConsumer' can be used to adapt the Dynamo DB Streams ( https://issues.apache.org/jira/browse/FLINK-4582 ). 'FlinkKinesisConsumer' 可用于调整 Dynamo DB Streams ( https://issues.apache.org/jira/browse/FLINK-4582 )。

destinations: https://docs.aws.amazon.com/kinesisanalytics/latest/java/how-sinks.html目的地: https : //docs.aws.amazon.com/kinesisanalytics/latest/java/how-sinks.html

'FlinkKinesisFirehoseProducer ' can be used to write into 'Kinesis data firehose'. “FlinkKinesisFirehoseProducer”可用于写入“Kinesis data firehose”。 There is no direct integration with Redshift.没有与 Redshift 直接集成。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM