简体繁体中英

Scraping a website that URL doesn't change when clicking on "next page" button

原文 2021-10-14 00:02:42 0 1 python/ selenium/ web-scraping

I'm trying to scrape a BBC website

https://www.bbc.com/news/topics/c95yz8vxvy8t/hong-kong-anti-government-protests

and I would like to get all the news articles. But the URL doesn't change when clicking on the next page button so I can only get the first page information. Can anyone help? I'm using Selenium but familiar with requests too. Thanks!

1 answers

use developer console in your browser, go to networks tab, disable cache. you can see api requests being made for each page change. you dont need selenium, you can just use requests or aiohttp.

this is an example: https://push.api.bbci.co.uk/batch?t=%2Fdata%2Fbbc-morph-lx-commentary-data-paged%2Fabout%2Fd5803bfc-472d-4abf-b334-d3fc4aa8ebf9%2FisUk%2Ffalse%2Flimit%2F20%2FnitroKey%2Flx-nitro%2FpageNumber%2F2%2Fversion%2F1.5.6?timeout=5

type "batch" in the filter bar and you should see only the api calls I believe to be responsible for page change.

you can get the about id(d5803bfc-472d-4abf-b334-d3fc4aa8ebf9) of this topic in the webpage source. in this case in, https://www.bbc.com/news/topics/c95yz8vxvy8t/hong-kong-anti-government-protests

Scraping a website which has a table but the next button on the table doesn't change the url

Can't go on clicking on the next page button while scraping certain fields from a website

Scraping data from a website that URL does not change when clicking on a particular onclick button

url doesn't change when moving to the next page

Scraper doesn't stop clicking on the next page button

Scraping through multiple pages when url doesn't change

Scraping data from a site where URL doesn't change on clicking 'Show More'

Crawl data from next page doesn't change URL

Can't scrape titles from a website while clicking on the next page button

Scraping a website for multiple pages that contains _dopostback method and the URL doesn't change for the pages

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Scraping a website which has a table but the next button on the table doesn't change the url Can't go on clicking on the next page button while scraping certain fields from a website Scraping data from a website that URL does not change when clicking on a particular onclick button url doesn't change when moving to the next page Scraper doesn't stop clicking on the next page button Scraping through multiple pages when url doesn't change Scraping data from a site where URL doesn't change on clicking 'Show More' Crawl data from next page doesn't change URL Can't scrape titles from a website while clicking on the next page button Scraping a website for multiple pages that contains _dopostback method and the URL doesn't change for the pages

Related Tags

Scraping a website that URL doesn't change when clicking on "next page" button

Question

1 answers

solution1 1 2021-10-14 01:54:10

solution1
1 2021-10-14 01:54:10