简体   繁体   English

如果我有大量数据,我应该创建多数据 stream 而不是单个数据 stream

[英]If i have large amount of data, should i create multi data stream instead of a single data stream

If the storage capacity is about several Trillionbyte, should i use a single data stream?如果存储容量大约是几万亿字节,我应该使用单个数据 stream 吗? like this:像这样:

data stream aaa, contains index:aaa-2022.01.06-0001,aaa-2022.01.06-0002,aaa-2022.01.07-0003数据 stream aaa,包含索引:aaa-2022.01.06-0001,aaa-2022.01.06-0002,aaa-2022.01.07-0003

or several data stream或几个数据 stream

data stream one: aaa-2022.01.06,constains index:aaa-2022.01.06-2022.01.06-0001数据 stream 一:aaa-2022.01.06,包含索引:aaa-2022.01.06-2022.01.06-0001

data stream two: aaa-2022.01.07,constains index:aaa-2022.01.07-2022.01.07-0001数据 stream 两个:aaa-2022.01.07,包含索引:aaa-2022.01.07-2022.01.07-0001

Clearly the former as data streams are managed by ILM policies and automatically name their underlying indexes with the index creation date (ie .ds-<data-stream>-<yyyy.MM.dd>-<generation> ), so you wouldn't also add the current date in the data stream name itself.显然,前者是由 ILM 策略管理的数据流,并使用索引创建日期(即.ds-<data-stream>-<yyyy.MM.dd>-<generation> )自动命名它们的基础索引,所以你不会t 还在数据 stream 名称本身中添加当前日期。

Just define the adequate ILM policy for your data stream (with proper rollover period and/or size and retention) and you're good.只需为您的数据 stream 定义适当的 ILM 策略(具有适当的翻转期和/或大小和保留),您就可以了。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM