简体   繁体   English

分布式Lucene.NET

[英]Distributed Lucene.NET

I have a Terabyte of data, maybe more, which I'd like to index and search with Lucene. 我有1 TB的数据,也许还有更多,我想用Lucene进行索引和搜索。 I'd like to be able to split the index out to different machines, similar to what Solr does (if I understand Solr correctly). 我希望能够将索引拆分到不同的机器上,类似于Solr所做的(如果我正确理解Solr的话)。

Are there any existing tools to do this on the Windows platform? Windows平台上是否有任何现有工具可以执行此操作?

Thanks! 谢谢!

Edit : I'm not very keen on running Java Lucene. 编辑 :我不是很热衷于运行Java Lucene。 I will most likely be making my own tweaks to Lucene so I have to stick to Lucene.Net since I don't know much about Java 我很可能会对Lucene进行自己的调整,因此我必须坚持使用Lucene.Net,因为我对Java不太了解

What you're looking for is Katta . 您正在寻找的是Katta Here's a graph of how it works: 这是它如何工作的图形: 卡塔
(source: sourceforge.net ) (来源: sourceforge.net

But since you already know Solr, why not just use its sharding capabilities directly? 但是,既然您已经了解Solr,为什么不直接使用其分片功能呢?

据我所知,没有将MultiPassIndexSplitter( http://lucene.apache.org/java/3_0_0/api/contrib-misc/org/apache/lucene/index/MultiPassIndexSplitter.html )类移植到Lucene.net,因此此功能可能尚未实现。

Solr is a Java app so it runs on Windows. Solr是Java应用程序,因此可以在Windows上运行。 You can find details on how to configure it as a Windows service here: http://blog.ianbattersby.com/archive/2010/02/09/apache-solr-as-a-windows-service 您可以在此处找到有关如何将其配置为Windows服务的详细信息: http : //blog.ianbattersby.com/archive/2010/02/09/apache-solr-as-a-windows-service

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM