简体   繁体   English

什么是Java的优秀Web搜索和Web爬网引擎?

[英]What is a good Web search and web crawling engine for Java?

I am working on an application where I need to integrate the search engine. 我正在开发一个我需要集成搜索引擎的应用程序。 This should do crawling also. 这也应该做爬行。 Please suggest a good Java based search engine. 请建议一个好的基于Java的搜索引擎。

Thank you in advance. 先感谢您。

Nutch( Lucene )是一个开源引擎,可以满足您的需求。

In the past I worked with terrier , a search engine written in Java: 在过去,我曾与一个用Java编写的搜索引擎terrier一起工作:

Terrier is a highly flexible, efficient, effective, and robust search engine, readily deployable on large-scale collections of documents. Terrier是一种高度灵活,高效,有效且强大的搜索引擎,可轻松部署在大型文档集上。 Terrier implements state-of-the-art indexing and retrieval functionalities. Terrier实现了最先进的索引和检索功能。 Terrier provides an ideal platform for the rapid development of large-scale retrieval applications. Terrier为大规模检索应用的快速开发提供了理想的平台。

I've spent the last 2 years developing our own high performance search engine with C. For Java I highly suggest Apache Lucene as Ajay mentioned above. 我花了两年时间用C开发我们自己的高性能搜索引擎。对于Java,我强烈建议将Apache Lucene用作上面提到的Ajay。 For Java, it's the best in terms of speed, relevancy and features. 对于Java,它在速度,相关性和功能方面是最好的。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM