简体   繁体   English

通过 Kinesis Stream 或 Firehose 写入 S3

[英]Writing to S3 via Kinesis Stream or Firehose

I have events that keep coming which I need to put to S3.我有不断发生的事件,我需要将它们放到 S3 中。 I am trying to evaluate if I muse use Kinesis Stream or Firehose.我正在尝试评估我是否使用 Kinesis Stream 或 Firehose。 I also want to wait for few minutes before writing to S3 so that the object is fairly full.我还想在写入 S3 之前等待几分钟,以便 object 相当满。

Based on my reading of Kinesis Data stream, I have to create an analytics app which will then be used to invoke a lambda.根据我对 Kinesis Data stream 的阅读,我必须创建一个分析应用程序,然后将其用于调用 lambda。 I will then have to use the lambda to write to S3.然后我将不得不使用 lambda 写入 S3。 Or Kinesis Data Streams can directly write to lambda somehow?或者 Kinesis Data Streams 可以以某种方式直接写入 lambda? I could not find anything indicating the same.我找不到任何表明相同的东西。

Firehose is not charged by hour(while stream is). Firehose 不按小时收费(而 stream 是)。 So is firehose a better option for me?那么firehose对我来说是更好的选择吗?

Or Kinesis Data Streams can directly write to lambda somehow?或者 Kinesis Data Streams 可以以某种方式直接写入 lambda?

Data Streams can't write directly to S3.数据流无法直接写入 S3。 Instead Firehose can do this:相反,Firehose可以这样做:

delivering real-time streaming data to destinations such as Amazon Simple Storage Service (Amazon S3) , Amazon Redshift, Amazon Elasticsearch Service (Amazon ES), Splunk, and any custom HTTP endpoint or HTTP endpoints owned by supported third-party service providers, including Datadog, MongoDB, and New Relic.向 Amazon Simple Storage Service (Amazon S3) 、Amazon Redshift、Amazon Elasticsearch Service (Amazon ES)、Splunk 和任何自定义 HTTP 端点或 Z293C9EA246FF9985DC6F62A6 等第三方拥有的端点服务提供商等目的地提供实时流数据Datadog、MongoDB 和 New Relic。

What's more Firehose allows you to buffer the records before writing them to S3.更重要的是,Firehose 允许您在将记录写入 S3 之前对其进行缓冲 The writing can happen based on buffer size or time.写入可以根据缓冲区大小或时间发生。 In addition to that you can process the records using lambda function before writing to S3.除此之外,您可以在写入 S3 之前使用 lambda function 处理记录

Thus, colectively it seems that Firehose is more suited to your use-case then Data Streams.因此,总的来说,Firehose 似乎比 Data Streams 更适合您的用例。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM