简体繁体中英

Cannot scrape with beautifulsoup and urllib because of javascript variable

原文 2013-05-19 18:12:46 6 1 javascript/ beautifulsoup/ urllib2

Unfortunately I am newbie with beautifulsoup and urllib so I might not even ask correctly what I need.. There is a website www.example.com I need to extract some data from this website which displays a random message.

The problem is the message is displayed after the user presses a button, otherwise it shows a general message like "press the button to see the message".

After searching stackoverflow I realised that probably there is NO way to change the variables by calling with my browser the url like this.. www.example.com/?showRandomMsg='true'

In some threads I read that maybe I can do it with bookmarlets..

Is there anyway to use bookmarklets with beautifulsoup or urllib in order to access the website and make it display a random message?

Thanks in advance! :D

1 answers

I came back after a long time just to answer quickly my own question..

I found many solutions and tutorials on the web and most of them were suggesting using Selenium and xpath but this method was more complex than I needed..

So I ended up using Selenium ONLY for emulating the Browser (firefox in my case) and grabbing the html after the page was loaded completely.

After that I was still using beautifoulsoup to parse the html code (whihc now would include the javascript data too).

BeautifulSoup scrape from javascript (encoded) variable

Scrape a javascript variable from a webpage

How do I scrape data generated with javascript using BeautifulSoup?

Scrape javascript table with beautifulSoup which loads table data everytime on click

scrape span using BeautifulSoup

Missing html in beautifulsoup scrape

ruby nokogiri restclient to scrape javascript variable

Finding JavaScript variable with certain string with BeautifulSoup

Unable to scrape few details in BeautifulSoup

Trying to scrape iframe using beautifulsoup

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question BeautifulSoup scrape from javascript (encoded) variable Scrape a javascript variable from a webpage How do I scrape data generated with javascript using BeautifulSoup? Scrape javascript table with beautifulSoup which loads table data everytime on click scrape span using BeautifulSoup Missing html in beautifulsoup scrape ruby nokogiri restclient to scrape javascript variable Finding JavaScript variable with certain string with BeautifulSoup Unable to scrape few details in BeautifulSoup Trying to scrape iframe using beautifulsoup

Related Tags

Cannot scrape with beautifulsoup and urllib because of javascript variable

Question

1 answers

solution1 1 ACCPTED 2014-05-31 21:33:31

solution1
1 ACCPTED 2014-05-31 21:33:31