简体繁体中英

How to get the main content of an article from HTML using boilerplate?

原文 2016-10-10 06:53:30 0 1 java/ summarization/ boilerpipe

I am trying to get the main content of an article from an HTML using boilerpipe code.

Downloaded the latest jars from here .

I am trying to use the following code:

String article = "";
try {
    article = ArticleExtractor.INSTANCE.getText(url);   
    System.out.println("Article ++++ >>" + article);    
} catch (BoilerpipeProcessingException e) {
    // TODO Auto-generated catch block
    e.printStackTrace();
}

But this returns an empty string for every URL . Can anyone help me on this?

1 answers

Have you tried to pass the HTML itself instead of the url? Or maybe there is a problem with the way your url strings are formatted.

HOW to get article content from many urls webpages

How to extract published-time and article-content from a news article using java?

android get news article content

How to get HTML content using class name?

How to get the HTML content from HttpServletResponse?

How to get HTML Content from WebView for print?

How to read/parse article content from link to string

Extract article's headline from HTML(using Boilerpipe)

Get the main content from RSS Feeds

How can I get HTML content from a specific URL on server side by using Java?

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question HOW to get article content from many urls webpages How to extract published-time and article-content from a news article using java? android get news article content How to get HTML content using class name? How to get the HTML content from HttpServletResponse? How to get HTML Content from WebView for print? How to read/parse article content from link to string Extract article's headline from HTML(using Boilerpipe) Get the main content from RSS Feeds How can I get HTML content from a specific URL on server side by using Java?

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM