简体   繁体   English

Elasticsearch 索引架构,适用于大数据和更多更新/删除操作

[英]Elasticsearch index Architecture for large data and having more update/delete operations

I have a index which has almost now 50GB of data and it will exceed to 100GB to soon!我有一个索引,现在几乎有 50GB 的数据,很快就会超过 100GB! so now I would like to setup index architecture for the better performance.所以现在我想设置索引架构以获得更好的性能。

I have checkout many things one of them is Index LifeCycle but as the index which i have that can be updated at anytime!我已经检查了很多东西,其中之一是Index LifeCycle但作为我可以随时更新的索引! so in that case how can I design my index so that will be good for perfomance.所以在这种情况下,我如何设计我的索引,以便有利于性能。

Another thing is that As I have found an artical Dynamic Indices on the update and delete records from Index.另一件事是,正如我在更新和从索引中删除记录时发现的动态索引一样。 it shows that data will be find out while we perform any search operations!它表明在我们执行任何搜索操作时都会找到数据! as in my case I have too much update records in those cases it will reduce the performance of the index!就我而言,在这些情况下,我有太多的更新记录,这会降低索引的性能!

How can we improve our index performance when we have a large data and having to many update and delete operaion?数据量大,更新删除操作多,如何提高索引性能? what architecture should we follow?我们应该遵循什么样的架构?

Is all of your date likely to be updated or deleted or is it only the latest data?您的所有日期是否都可能被更新或删除,还是只是最新数据? If your updates are on fairly recent data and the old data is read-only you can create the hot-warm-cold architecture as described in this blog post.如果您的更新是基于相当新的数据并且旧数据是只读的,您可以创建博客文章中所述的热-温-冷架构。

If all of your data is likely to be updated you can do a hot-warm architecture, where all your updates go to hot node and all queries go to warm node.如果您的所有数据都可能被更新,您可以采用热温架构,其中所有更新都转到热节点,所有查询都转到暖节点。 Elastic will sync the hot and warm node to achieve eventual consistency so you might have to live with stale date for milliseconds I assume. Elastic 将同步热节点和温节点以实现最终一致性,因此您可能不得不忍受我假设的过时日期数毫秒。 Check this .检查这个

In my experience Elastic is able to easily handle 50-100 GB data even if you update and search from the same set of indices and nodes.根据我的经验,即使您从同一组索引和节点更新和搜索,Elastic 也能够轻松处理 50-100 GB 的数据。 It all depends on the rate of the updates and search.这一切都取决于更新和搜索的速度。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM