[英]If i have large amount of data, should i create multi data stream instead of a single data stream
If the storage capacity is about several Trillionbyte, should i use a single data stream?如果存储容量大约是几万亿字节,我应该使用单个数据 stream 吗? like this:
像这样:
data stream aaa, contains index:aaa-2022.01.06-0001,aaa-2022.01.06-0002,aaa-2022.01.07-0003数据 stream aaa,包含索引:aaa-2022.01.06-0001,aaa-2022.01.06-0002,aaa-2022.01.07-0003
or several data stream或几个数据 stream
data stream one: aaa-2022.01.06,constains index:aaa-2022.01.06-2022.01.06-0001数据 stream 一:aaa-2022.01.06,包含索引:aaa-2022.01.06-2022.01.06-0001
data stream two: aaa-2022.01.07,constains index:aaa-2022.01.07-2022.01.07-0001数据 stream 两个:aaa-2022.01.07,包含索引:aaa-2022.01.07-2022.01.07-0001
Clearly the former as data streams are managed by ILM policies and automatically name their underlying indexes with the index creation date (ie .ds-<data-stream>-<yyyy.MM.dd>-<generation>
), so you wouldn't also add the current date in the data stream name itself.显然,前者是由 ILM 策略管理的数据流,并使用索引创建日期(即
.ds-<data-stream>-<yyyy.MM.dd>-<generation>
)自动命名它们的基础索引,所以你不会t 还在数据 stream 名称本身中添加当前日期。
Just define the adequate ILM policy for your data stream (with proper rollover period and/or size and retention) and you're good.只需为您的数据 stream 定义适当的 ILM 策略(具有适当的翻转期和/或大小和保留),您就可以了。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.