简体繁体中英

Does elasticsearch/lucene impose memory overhead for missing values in fieldcache?

原文 2015-12-15 00:50:21 9 1 elasticsearch/ lucene

This question is for Elasticsearch primarily, but I believe the answer will be based on underlying Lucene semantics.

I'm contemplating using multiple types in the same index. A lot of fields will be sortable and a lot of fields will only be used by one particular type. Ie: fields will be sparse, say 10% coverage on average.

Since sorting keeps values for all docs in memory (regardess of type) , I'd like to know if there's any memory overhead with regards to missing fieldvalues (the ~90% in my case)

1 answers

In a recent blog post on the official Elasticsearch blog titled "Index vs Type" , the author tackles a common problematic when it comes to choosing whether one wants to model his data using several indices or several types.

One fact is that Lucene indices don't like sparsity. As a result, the author says that

Fields that exist in one type will also consume resources for documents of types where this field does not exist. [...] And the issue is even worse with doc values: for speed reasons, doc values often reserve a fixed amount of disk space for every document, so that values can be addressed efficiently.

There is a Lucene issue that aims at improving this situation, which has been fixed in 5.4 and will be available in Elasticsearch v2.2. Even then, the author advises to still model your data in a way to limits sparsity as much as possible.

How does ElasticSearch and Lucene share the memory

ElasticSearch overhead over Lucene + custom clustering solution

Elasticsearch/Lucene null handling in doc values

Does elasticsearch automatically update the lucene index on upgrade?

Elasticsearch or Lucene

Lucene to Elasticsearch

Overhead of an elasticsearch river?

How does doc_values of Lucene is implemented?

Elasticsearch and Lucene document limit

ElasticSearch Lucene UnicodeUtil not found

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How does ElasticSearch and Lucene share the memory ElasticSearch overhead over Lucene + custom clustering solution Elasticsearch/Lucene null handling in doc values Does elasticsearch automatically update the lucene index on upgrade? Elasticsearch or Lucene Lucene to Elasticsearch Overhead of an elasticsearch river? How does doc_values of Lucene is implemented? Elasticsearch and Lucene document limit ElasticSearch Lucene UnicodeUtil not found

Related Tags

Does elasticsearch/lucene impose memory overhead for missing values in fieldcache?

Question

1 answers

solution1 0 ACCPTED 2015-12-15 03:28:32

solution1
0 ACCPTED 2015-12-15 03:28:32