简体繁体 English

如果我有大量数据，我应该创建多数据 stream 而不是单个数据 stream

[英]If i have large amount of data, should i create multi data stream instead of a single data stream

原文 2022-01-06 03:42:58 9 1 elasticsearch/ data-stream

If the storage capacity is about several Trillionbyte, should i use a single data stream?如果存储容量大约是几万亿字节，我应该使用单个数据 stream 吗？ like this:像这样：

data stream aaa, contains index:aaa-2022.01.06-0001,aaa-2022.01.06-0002,aaa-2022.01.07-0003数据 stream aaa，包含索引：aaa-2022.01.06-0001,aaa-2022.01.06-0002,aaa-2022.01.07-0003

or several data stream或几个数据 stream

data stream one: aaa-2022.01.06,constains index:aaa-2022.01.06-2022.01.06-0001数据 stream 一：aaa-2022.01.06，包含索引：aaa-2022.01.06-2022.01.06-0001

data stream two: aaa-2022.01.07,constains index:aaa-2022.01.07-2022.01.07-0001数据 stream 两个：aaa-2022.01.07，包含索引：aaa-2022.01.07-2022.01.07-0001

1 个解决方案

Clearly the former as data streams are managed by ILM policies and automatically name their underlying indexes with the index creation date (ie .ds-<data-stream>-<yyyy.MM.dd>-<generation> ), so you wouldn't also add the current date in the data stream name itself.显然，前者是由 ILM 策略管理的数据流，并使用索引创建日期（即.ds-<data-stream>-<yyyy.MM.dd>-<generation> ）自动命名它们的基础索引，所以你不会t 还在数据 stream 名称本身中添加当前日期。

Just define the adequate ILM policy for your data stream (with proper rollover period and/or size and retention) and you're good.只需为您的数据 stream 定义适当的 ILM 策略（具有适当的翻转期和/或大小和保留），您就可以了。

如何创建作业以将Elasticsearch索引中的所有文档作为数据流处理？ - How can I create a job to process all documents in an elasticsearch index as a data stream?

将大量数据编入Elasticsearch中 - Index large amount of data into elasticsearch

我应该使用Elastic Search作为我的数据存储而不是MySQL吗？ - Should I use Elastic Search as my data store instead of MySQL?

我在 elasticsearch 中创建了一个多字段，但它没有返回任何数据 - I have created a multi field in elastisearch but it's not returning any data

ELK 重新索引数据流到位 - ELK reindex data stream in place

通过logstash创建数据stream - creating data stream through logstash

Akka 流停止处理数据 - Akka stream stops processing data

如何在 python 客户端上创建 Elasticsearch 数据流模板？ - How to create Elasticsearch data stream template on a python's client?

如何从 Kinesis firehose 将 stream 数据传输到自托管 elasticsearch 集群？ - How can I stream data to self hosted elasticsearch cluster from Kinesis firehose?

如何快速聚合大量数据 - How to quickly aggregate large amount of data

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何创建作业以将Elasticsearch索引中的所有文档作为数据流处理？ - How can I create a job to process all documents in an elasticsearch index as a data stream? 将大量数据编入Elasticsearch中 - Index large amount of data into elasticsearch 我应该使用Elastic Search作为我的数据存储而不是MySQL吗？ - Should I use Elastic Search as my data store instead of MySQL? 我在 elasticsearch 中创建了一个多字段，但它没有返回任何数据 - I have created a multi field in elastisearch but it's not returning any data ELK 重新索引数据流到位 - ELK reindex data stream in place 通过logstash创建数据stream - creating data stream through logstash Akka 流停止处理数据 - Akka stream stops processing data 如何在 python 客户端上创建 Elasticsearch 数据流模板？ - How to create Elasticsearch data stream template on a python's client? 如何从 Kinesis firehose 将 stream 数据传输到自托管 elasticsearch 集群？ - How can I stream data to self hosted elasticsearch cluster from Kinesis firehose? 如何快速聚合大量数据 - How to quickly aggregate large amount of data

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM