简体繁体中英

How to loop through each page of website for web scraping with BeautifulSoup

原文 2017-09-20 23:04:25 6 1 python/ html/ web-scraping/ beautifulsoup

I am scraping job posting data from a website using BeautifulSoup. I have working code that does what I need, but it only scrapes the first page of job postings. I am having trouble figuring out how to iteratively update the url to scrape each page. I am new to Python and have looked at a few different solutions to similar questions, but have not figured out how to apply them to my particular url. I think I need to iteratively update the url or somehow click the next button and then loop my existing code through each page. I appreciate any solutions.

url: https://jobs.utcaerospacesystems.com/search-jobs

1 answers

First, BeautifulSoup doesn't have anything to do with GETing web pages - you get the webpage yourself, then feed it to bs4 for processing.

The problem with the page you linked is that it's javascript - it only renders correctly in a browser (or any other javascript VM).

@Fabricator is on the right track - you'll need to watch the developer console and see what the ajax requests the js is sending to the server. In this case, also take a look at the query string params, which include a param called CurrentPage - that's probably the one you want to focus on.

How to loop through a list of urls for web scraping with BeautifulSoup

Python web scraping using BeautifulSoup, how to loop through complicated URL?

How to loop through website with BeautifulSoup?

How to loop through a nested web page for web scraping?

Web scraping through multiple pages doesnt save each result -beautifulsoup

How to loop through scraping multiple documents on multiple web pages using BeautifulSoup?

Web Scraping through Python BeautifulSoup

Web scraping through pagination with BeautifulSoup

how to web scraping - beautifulSoup

How to iterate through a list of URLs for BeautifulSoup Web Scraping?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to loop through a list of urls for web scraping with BeautifulSoup Python web scraping using BeautifulSoup, how to loop through complicated URL? How to loop through website with BeautifulSoup? How to loop through a nested web page for web scraping? Web scraping through multiple pages doesnt save each result -beautifulsoup How to loop through scraping multiple documents on multiple web pages using BeautifulSoup? Web Scraping through Python BeautifulSoup Web scraping through pagination with BeautifulSoup how to web scraping - beautifulSoup How to iterate through a list of URLs for BeautifulSoup Web Scraping?

Related Tags

How to loop through each page of website for web scraping with BeautifulSoup

Question

1 answers

solution1 0 ACCPTED 2017-09-20 23:15:52

solution1
0 ACCPTED 2017-09-20 23:15:52