[英]Using Nutch to Retrive Page Contents
I have a very large list of seeds to be crawled (only those seeds are needed without any deepening). 我要爬的种子列表很大(只需要这些种子即可,无需任何深化)。 How can I use Nutch to retrieve: 如何使用Nutch检索:
the seed pages? 种子页? (without any indexing and integration into any other platform like Solr). (没有任何索引并集成到任何其他平台(如Solr))。
Thanks 谢谢
Well, there are many issues you want to address. 好吧,您想解决许多问题。 Below are the issues with their solutions: 以下是其解决方案的问题:
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.