简体   繁体   中英

By passing incapsula security- python

I am trying to crawl the football data from the whoscored.com, the website has incapsula web oriented security which is not letting me crawl. Initially, I tried to give user_agent and changed the header then it worked but that's only for the first page. As I need to crawl some other parts of the website it keeps blocking me to request the website. It's getting exhausting now since the blocking time has been increasing.

Is there anybody who could suggest something to bypass the incapsula security mechanisms.

I need data for study purposes.

I have gone through all the old the previous question asked about this topic. but it does not help.

Tools. anaconda, language python, and library beautiful soup

Thanks.

if you mimic browser headers and appropiate time between requests, it will probably work

look at your request headers and that of your browser

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM