简体   繁体   English

Storm Crawler-搜寻需要认证的网站

[英]Storm Crawler- Crawling the websites which require authentication

I would like to crawl websites which require authorization (I already have credentials) in intranet with Storm Crawler. 我想使用Storm Crawler在Intranet中对需要授权(我已经有凭据)的网站进行爬网。 Is it possible to do that by simply modifying the crawler configuration or should I alter the classes in the source code, if so, which classes? 是否可以通过简单地修改搜寻器配置来做到这一点,或者我应该更改源代码中的类(如果这样),请选择哪些类?

this is not currently available. 当前不可用。 I have opened an issue for this #427 , you'd need to modify the HttpProtocol class. 我为此#427打开了一个问题,您需要修改HttpProtocol类。 This would be a great contribution and would be very welcome. 这将是巨大的贡献,将是非常受欢迎的。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM