简体   繁体   中英

Nutch how to crawl all links from one website?

for now i use the follow commands to crawl a website:

 bin/nutch generate -topN 20
 bin/nutch fetch -all
 bin/nutch parse -all
 bin/nutch updatedb

but with this method it takes ages before i have all links from that website. I want to crawl one website and get all the links.

how can i achieve this?

bin/nutch crawl是您要查找的命令

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM