[英]How does ElasticSearch handle an index with 230m entries?
I was looking through elasticsearch and was noticing that you can create an index and bulk add items. 我一直在寻找Elasticsearch,并注意到可以创建索引并批量添加项目。 I currently have a series of flat files with 220 million entries.
我目前拥有一系列带有2.2亿条目的平面文件。 I am working on Logstash to parse and add them to ElasticSearch, but I feel that it existing under 1 index would be rough to query.
我正在使用Logstash进行分析并将其添加到ElasticSearch,但是我觉得它在1索引以下的存在将很难查询。 The row data is nothing more than 1-3 properties at most.
行数据最多不过是1-3个属性。
How does Elasticsearch function in this case? 在这种情况下,Elasticsearch如何起作用? In order to effectively query this index, do you just add additional instances to the cluster and they will work together to crunch the set?
为了有效地查询该索引,您是否只是将其他实例添加到集群中,它们将一起工作以处理集合?
I have been walking through the documentation, and it is explaining what to do, but not necessarily all the time explaining why it does what it does. 我一直在浏览文档,它在解释要做的事情,但不一定总是在解释为什么要这样做。
In order to effectively query this index, do you just add additional instances to the cluster and they will work together to crunch the set?
为了有效地查询该索引,您是否只是将其他实例添加到集群中,它们将一起工作以处理集合?
That is exactly what you need to do. 那正是您需要做的。 Typically it's an iterative process:
通常,这是一个迭代过程:
Since you mention Logstash, there are a few things that may help further: 既然您提到Logstash,那么有些事情可能会进一步帮助您:
Good luck! 祝好运!
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.