JSOUP链接提取的标题？

Question

im using Java and Jsoup and the portion of HTML I'm trying to utilize is 我正在使用Java和Jsoup，而我尝试利用的HTML部分是

<i class="fa fa-star"></i> <a href="#taskruns" data-toggle="tab">396900 runs submitted</a>

I just need to extract the title "396900 runs" 我只需要提取标题“ 396900次运行”

How would I go about doing this? 我将如何去做呢？ I'm fairly new to parsing and web scraping 我对解析和网络抓取还很陌生

Answer 1

This is how can you exact text from html. 这是您如何从html提取文本。

import java.io.IOException;  
import org.jsoup.Jsoup;  
import org.jsoup.nodes.Document;  
import org.jsoup.nodes.Element;

public class WebScraping{  
    public static void main( String[] args ) throws IOException{  
            String html = "<i class='fa fa-star'></i> <a href='#taskruns' data-toggle='tab'>396900 runs submitted</a>";


            Document doc = Jsoup.parse(html); //First you have to parse html 
            Element link = doc.select("a").first(); //Then find the css selector from which you want to extract data

            String linkText = link.text(); //Then extract the text from selector

            System.out.println(linkText);
    }  
}

You can learn more from here . 您可以从这里了解更多信息。

JSOUP链接提取的标题？

问题描述

1 个解决方案

解决方案1
0 已采纳 2018-04-08 07:20:45

JSOUP链接提取的标题？

问题描述

1 个解决方案

解决方案1 0 已采纳 2018-04-08 07:20:45

解决方案1
0 已采纳 2018-04-08 07:20:45