简体繁体中英

Scraping Information from multiple URLS that are different in structure

原文 2021-05-11 09:39:33 9 1 python/ web/ web-scraping/ beautifulsoup

I would like to scrape multiple URLS but they are of different nature, such as different company websites with different html backend. Is there a way to do it without coming up with a customised code for each url?

Understand that I can put multiple URLS into a list and loop them

1 answers

I fear not, but I am not an expert:-)

I could imagine that it depends on the complexity of the structures. If you want to find a the text "Test" on every website, I coul imagine that soup.body.findAll(text='Test') would return all occurences of "Test" on the website.

I assume you're aware of how to loop through a list here, so that you'd loop through the list of URLS and for each check whether the searched string occurs (maybe you are looking for sth else, ie an "apply" button or "login"?

all the best,

Scraping multiple single pages from different domains(mostly) with different structure

Scraping tables from Multiple URLs

Scraping different variables from multiple URLs into one single CSV file using Python

Looping multiple URLs <python scraping issue> ( 2 different URL's from same website)

Selenium scraping with multiple urls

Selenium - web scraping multiple urls for same contents but slightly different xpaths

Scraping multiple urls from same website multiple pages

Scraping text data from different domain urls using Python

Sequential scraping from multiple start_urls leading to error in parsing

Python Scrapy - Scraping data from multiple website URLs

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Scraping multiple single pages from different domains(mostly) with different structure Scraping tables from Multiple URLs Scraping different variables from multiple URLs into one single CSV file using Python Looping multiple URLs <python scraping issue> ( 2 different URL's from same website) Selenium scraping with multiple urls Selenium - web scraping multiple urls for same contents but slightly different xpaths Scraping multiple urls from same website multiple pages Scraping text data from different domain urls using Python Sequential scraping from multiple start_urls leading to error in parsing Python Scrapy - Scraping data from multiple website URLs

Related Tags

Scraping Information from multiple URLS that are different in structure

Question

1 answers

solution1 0 2021-05-11 09:45:14

solution1
0 2021-05-11 09:45:14