简体繁体中英

HTML Scraping with Javascript

原文 2013-08-23 21:15:39 5 1 javascript/ html/ screen-scraping

I use a simple javascript script, in a batch file, to download audio and video - radio and tv shows - from the BBC iPlayer.

Part of the script extracts data from the BBC's xml pages.

I now want to try extracting data from a html page. Can anyone point me to a javascript method for extracting data from an ordinary .htm or .html page?

I'm anxious to keep things simple, by having a javascript routine which I can include in a html page on my website, so I'm only interested in javascript solutions. Thanks.

Edit, 24 Aug -

The BBC's html pages don't respond to the Javascript scripts which successfully parse their xml pages.

I use a simple javascript to interrogate xml, based on this -

function loadXML() { xmlDoc = new ActiveXObject("Microsoft.XMLDOM"); xmlDoc.async = false; xmlDoc.onreadystatechange = readXML; xmlDoc.load(url); }

1 answers

Your question is kinda vague. I think there may be two ways to get this done: 1. apply RegExp to match patterns 2. import the html into a dom simulator and walk the tree to find the data ( I assume you using nodejs )

Javascript HTML Scraping

Scraping HTML (or JavaScript) Table

Scraping HTML and JavaScript

Data scraping help html and javascript

Scraping HTML data with JavaScript or Python

scraping data from Javascript in html page

Scraping Javascript Website With BeautifulSoup 4 & Requests_HTML

Scraping data from a JavaScript code that is present inside an HTML file

Web Scraping ; How to render html after javascript run?

WYSIWYG web scraping/crawling setup using Javascript/html5?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Javascript HTML Scraping Scraping HTML (or JavaScript) Table Scraping HTML and JavaScript Data scraping help html and javascript Scraping HTML data with JavaScript or Python scraping data from Javascript in html page Scraping Javascript Website With BeautifulSoup 4 & Requests_HTML Scraping data from a JavaScript code that is present inside an HTML file Web Scraping ; How to render html after javascript run? WYSIWYG web scraping/crawling setup using Javascript/html5?

Related Tags

HTML Scraping with Javascript

Question

1 answers

solution1 0 2013-08-23 21:22:44

solution1
0 2013-08-23 21:22:44