简体繁体 English

需要通过Mechanize + BeautifulSoup（Python）启用Javascript的抓取网站

[英]Scraping site that requires Javascript enabled with Mechanize + BeautifulSoup (Python)

原文 2010-11-06 17:24:37 5 2 javascript/ python/ screen-scraping/ mechanize

So.. i got this site I am tryign to scrape, but as I understand lack of support of mechanize for .js, and a stuborn site that requires javascript enabled browser is not a good mix... 所以..我很想刮这个网站，但是据我了解，缺乏对.js机械化的支持，而一个需要启用JavaScript的浏览器的繁华网站却不是很好的组合...

I am looking for ideas, on how to do this... 我正在寻找有关如何执行此操作的想法...

URL : https://members.iracing.com/membersite/login.jsp 网址： https : //members.iracing.com/membersite/login.jsp

2 个解决方案

Depending on what you need to do, you could use webkit to parse the page, which will allow you to get the final html after the javascript has been executed. 根据您需要执行的操作，可以使用webkit来解析页面，这将使您可以在执行javascript之后获取最终的html。 You could then use any decent html parser, beautifulsoup for example, to do the rest. 然后，您可以使用任何不错的html解析器（例如beautifulsoup）来完成其余的工作。

使用JavaScript，我将Chickenfoot用于简单的网站，将Webkit用于更复杂的网站。

本网站需要在您的浏览器中启用 Javascript - This site requires Javascript enabled in your browser

PHP 错误此站点需要启用 JavaScript - PHP Error this site requires JavaScript enabled

使用 BeautifulSoup 抓取 JavaScript (ReactTable) - Scraping JavaScript (ReactTable) with BeautifulSoup

使用Selenium和BeautifulSoup搜寻网站 - Scraping a site using Selenium and BeautifulSoup

在启用 Javascript 的情况下抓取网站？ - Scraping websites with Javascript enabled?

当页面需要启用JavaScript时，Python获取URL内容 - Python get URL contents when page requires JavaScript enabled

刮取需要使用Python登录的Javascript呈现页面 - Scraping Javascript-rendered page that requires login using Python

使用BeautifulSoup刮取包含JavaScript的网页 - Scraping a webpage that has JavaScript with BeautifulSoup

在python机械化的javascript中提交请求 - to submit request in python mechanize for javascript

Python /使用CSRF在Javascript页面上机械化 - Python/mechanize on javascript page with csrf

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 本网站需要在您的浏览器中启用 Javascript - This site requires Javascript enabled in your browser PHP 错误此站点需要启用 JavaScript - PHP Error this site requires JavaScript enabled 使用 BeautifulSoup 抓取 JavaScript (ReactTable) - Scraping JavaScript (ReactTable) with BeautifulSoup 使用Selenium和BeautifulSoup搜寻网站 - Scraping a site using Selenium and BeautifulSoup 在启用 Javascript 的情况下抓取网站？ - Scraping websites with Javascript enabled? 当页面需要启用JavaScript时，Python获取URL内容 - Python get URL contents when page requires JavaScript enabled 刮取需要使用Python登录的Javascript呈现页面 - Scraping Javascript-rendered page that requires login using Python 使用BeautifulSoup刮取包含JavaScript的网页 - Scraping a webpage that has JavaScript with BeautifulSoup 在python机械化的javascript中提交请求 - to submit request in python mechanize for javascript Python /使用CSRF在Javascript页面上机械化 - Python/mechanize on javascript page with csrf

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM