[英]ElasticSearch overhead over Lucene + custom clustering solution
I've experience of working on project where full-text search speed was boosted by replacement of ElasticSearch with Lucene + Hazelcast. 我有一个项目工作经验,通过用Lucene + Hazelcast取代ElasticSearch来提高全文搜索速度。
What may be the reasons of ElasticSearch overhead over Lucene + Hazelcast? 与Lucene + Hazelcast相比,ElasticSearch开销的原因是什么? Which ElasticSearch configs may cause for significant slowdown with the same resources?
哪些ElasticSearch配置可能导致相同资源的显着减速?
IndexWriter
to write separate index for each node working only with a local file system. IndexWriter
为每个仅使用本地文件系统的节点编写单独的索引。 Means each AVRO file will form one index per node. StringField
StringField
My reasons for using ES in this case would be 我在这种情况下使用ES的原因是
Future needs for project to explore data in more ways 项目未来需要以更多方式探索数据
Feature rich Aggregations API 功能丰富的Aggregations API
Support for Indexing using Spark / Hive etc - very easy to do and we can use pre processing of data efficiently. 使用Spark / Hive等支持索引 - 非常容易,我们可以有效地使用数据的预处理。
Auto Scaling / Adjust # of replications based on demand Auto Scaling /根据需求调整复制次数
and of course , not maintaining codebase to do all these. 当然,不保持代码库来完成所有这些。 This thread will be good discussion if you can add some expectations on flexibility from your end.
如果您可以从最终添加对灵活性的一些期望,那么这个主题将是很好的讨论。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.