简体繁体中英

Python BS4 Scraping Script Timer

原文 2016-11-29 04:26:28 2 1 python/ web-scraping/ beautifulsoup/ bs4

I have been trying to get this web scraping script working properly, and am not sure what to try next. Hoping someone here knows what I should do.

I am using BS4 and the problem is whenever a URL takes a long time to load it skips over that URL (leaving an output file with fewer inputs in times of high page load times). I have been trying to add on a timer so that it only skips over the url if it doesn't load in x seconds.

Can anyone point me in the right direction?

Thanks!

1 answers

尝试使用多线程或多处理来生成线程，我认为它将为每个请求生成一个线程，并且如果花费的时间太长，它也不会跳过URL。

Python, Scraping BS4

Python 3 scraping with Bs4

bs4 python web scraping

scraping using BS4 python

Scraping Table With Python/BS4

Scraping image with bs4 python

Scraping Script tag in HTML with Json and BS4

Scraping a script written in JS with BS4

Web Scraping using bs4 with Python

Python - super simple scraping with request and bs4

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Python, Scraping BS4 Python 3 scraping with Bs4 bs4 python web scraping scraping using BS4 python Scraping Table With Python/BS4 Scraping image with bs4 python Scraping Script tag in HTML with Json and BS4 Scraping a script written in JS with BS4 Web Scraping using bs4 with Python Python - super simple scraping with request and bs4

Related Tags

Python BS4 Scraping Script Timer

Question

1 answers

solution1 0 2016-11-29 07:32:10

solution1
0 2016-11-29 07:32:10