简体繁体中英

Intelligent web scraping c#

原文 2012-10-17 11:33:35 1 1 c#/ html/ web-scraping

Theres a number of products out there that provide a gui to pick out the tags you want to scrape from a web page. (Things like WebHarvy for example)

I've seen the HTML Agility Pack before for getting at the DOM. I just wanted to check if anyone knows of any nice libraries or processes for automatically finding the useful content within a HTML page and creating the XPath required.

Similar to how Evernote and iOS know where the "Article" is on a page. However ideally working for repeating regions and pagination.

1 answers

Not sure if this is what you are looking for:
http://www.diffbot.com/

But Diffbot is good in scraping content from websites.

Web Scraping with c# and HTMLAgilityPack

Efficient web scraping with C#?

Web Scraping using C#

selenium web scraping in c#

No results with C# web scraping

C# web scraping Javascript

Web Scraping with Captcha in C# using AngleSharp

Extract data from Web Scraping C#

Scraping dynamic web content in C#

Web page(html) scraping using C#

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Web Scraping with c# and HTMLAgilityPack Efficient web scraping with C#? Web Scraping using C# selenium web scraping in c# No results with C# web scraping C# web scraping Javascript Web Scraping with Captcha in C# using AngleSharp Extract data from Web Scraping C# Scraping dynamic web content in C# Web page(html) scraping using C#

Related Tags

Intelligent web scraping c#

Question

1 answers

solution1 0 2012-10-17 12:00:02

solution1
0 2012-10-17 12:00:02