简体繁体中英

Scraping data from complex website (hidden content)

原文 2018-06-18 15:16:07 7 1 python/ python-3.x/ beautifulsoup

I am just starting with web scraping and unfortunately, I am facing a showstopper: I would like pull some financial data but it seems that the website is quite complex (dynamic content etc.).

Data I would like pull

Website: https://www.de.vanguard/web/cf/professionell/de/produktart/detailansicht/etf/9527/EQUITY/performance

So far, I've used Beautiful Soup to get this done. However, I cannot even find the table. Any ideas?

1 answers

Look into using selenium to launch an automated web browser. This loads the web page and it's associated dynamic content, as well as allow you the option to 'click' on certain web elements to load content that may be generated on_click . You can use this in tandem with BeautifulSoup by passing driver.page_source to BeautifulSoup and parsing through it that way.

This SO answer provides a basic example that would serve as a good starting point: Python WebDriver how to print whole page source (html)

(Python) Scraping data from a website with 'style:hidden' tags?

Scraping data from website

Scraping a website with data hidden under "read more"

Scraping a website with data hidden under “Lihat Selengkapnya”

Scraping data from a complex graph

Scraping hidden content from a javascript webpage with python

Scraping content from website using Beaufifulsoup and Requests

Scraping hidden leaderboard data from site

Scraping excel from website using python with _doPostBack link url hidden

Selenium Web Scraping With Beautiful Soup on Dynamic Content and Hidden Data Table

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question (Python) Scraping data from a website with 'style:hidden' tags? Scraping data from website Scraping a website with data hidden under "read more" Scraping a website with data hidden under “Lihat Selengkapnya” Scraping data from a complex graph Scraping hidden content from a javascript webpage with python Scraping content from website using Beaufifulsoup and Requests Scraping hidden leaderboard data from site Scraping excel from website using python with _doPostBack link url hidden Selenium Web Scraping With Beautiful Soup on Dynamic Content and Hidden Data Table

Related Tags

Scraping data from complex website (hidden content)

Question

1 answers

solution1 0 ACCPTED 2018-06-18 15:20:50

solution1
0 ACCPTED 2018-06-18 15:20:50