for now i use the follow commands to crawl a website:
bin/nutch generate -topN 20
bin/nutch fetch -all
bin/nutch parse -all
bin/nutch updatedb
but with this method it takes ages before i have all links from that website. I want to crawl one website and get all the links.
how can i achieve this?
bin/nutch crawl
是您要查找的命令
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.