简体   繁体   中英

How to get just the content of a post from a blog?

I have just the url of a post, like http://www.avc.com/a_vc/2011/08/html5-continued.html , is ther any way of get the content of this post? I mean, exclude menus, logos and advertisements.

Thank you very much!

If you want to scrape the site, first consider whether this is legal.

Then, you can do that be getting the innerHTML (or with jQuery - the .html() ) of the appropriate element. In your case this is disqus_post_message

As @bensiu noted it would be easier to use the RSS feed.

Since you tagged Java, here are the libraries that can be useful:

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM