简体繁体中英

How to scrape a page generated with a script in C#?

原文 2011-04-20 15:59:02 3 1 c#/ web-scraping

Simple example: Google search page.

http://www.google.com/search?q=foobar

When I get the source of the page, I get the underlying JavaScript. I want the resulting page. What do I do?

1 answers

Even though it looks as if it is only javascript it really is the full HTML, you can easily confirm with HtmlAgilityPack :

HtmlAgilityPack.HtmlWeb web = new HtmlAgilityPack.HtmlWeb();
HtmlAgilityPack.HtmlDocument doc = web.Load("http://www.google.com/search?q=foobar");
string html = doc.DocumentNode.OuterHtml;
var nodes = doc.DocumentNode.SelectNodes("//div"); //returns 85 nodes

How to scrape text from an html page using C#?

Scrape Table from web page in c#

Scrape a javascript-generated website in C# without installing a browser

How to evaluate Javascript within C#? (need to get all links for a web page, including java-script generated ones)

How retrieve by c# page html generated with AngularJS

How to get raw page source (not generated source) from c#

How do i Screen scrape html page generated by javascript

Get generated script in MongoDB C# driver

Scrape data from web page with HtmlAgilityPack c#

Alternative of PuppeteerSharp and Selenium to scrape web page after login in C#

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to scrape text from an html page using C#? Scrape Table from web page in c# Scrape a javascript-generated website in C# without installing a browser How to evaluate Javascript within C#? (need to get all links for a web page, including java-script generated ones) How retrieve by c# page html generated with AngularJS How to get raw page source (not generated source) from c# How do i Screen scrape html page generated by javascript Get generated script in MongoDB C# driver Scrape data from web page with HtmlAgilityPack c# Alternative of PuppeteerSharp and Selenium to scrape web page after login in C#

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM