简体   繁体   中英

Are there any preferences to take snapshots of individual indices or to take snapshots of the whole cluster in Elasticsearch?

I am trying to understand if in Elasticsearch, there are any advantages to take snapshots of individual indices in comparison to taking snapshots of the whole cluster in terms of performance, hardware usage (CPU/RAM/Disk/Network) and comfortable restoring process of the snapshot in the future. Can anyone describe it to me?

Maybe this answer does not evaluate the two cases from all points of view. But I want to share one point: since it is mentioned in the Elasticsearch documentation that snapshots are taken from segments and snapshots are incrementally taken (only new segments are taken in new snapshots) [1], so it senses that taking a snapshot from n indices should not generally differ from taking n snapshots from each index (I'm not sure about exceptions such as metadata and feature states).

But there might be a limitation in taking a lot of snapshots from individual indices: if number of snapshots increases, memory of the master node and the cluster performance might be affected. It is mentioned in the Elasticsearch documentation where setting a retention for snapshots in configuring SLM is suggested [2].

[1] https://www.elastic.co/guide/en/elasticsearch/reference/current/snapshot-restore.html

[2] https://www.elastic.co/guide/en/elasticsearch/reference/current/snapshots-take-snapshot.html

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM