使用jSoup解析內部html標簽

Question

我想在使用Jsoup庫的網站中找到重要的鏈接。 因此，假設有以下代碼：

<h1><a href="http://example.com">This is important </a></h1>

現在，在解析的同時，我們如何發現標簽a在h1標簽內？

Answer 1

您可以這樣操作：

File input = new File("/tmp/input.html");
Document doc = Jsoup.parse(input, "UTF-8", "http://example.com/");

Elements headlinesCat1 = doc.getElementsByTag("h1");
for (Element headline : headlinesCat1) {
    Elements importantLinks = headline.getElementsByTag("a");
    for (Element link : importantLinks) {
        String linkHref = link.attr("href");
        String linkText = link.text();
        System.out.println(linkHref);
    }
}

摘自JSoup Cookbook 。

Answer 2

使用選擇器：

Elements elements = doc.select("h1 > a");

使用jSoup解析內部html標簽

問題描述

2 個解決方案

解決方案1
1 2015-06-10 11:25:55

解決方案2
0 2015-06-10 11:33:03

使用jSoup解析內部html標簽

問題描述

2 個解決方案

解決方案1 1 2015-06-10 11:25:55

解決方案2 0 2015-06-10 11:33:03

解決方案1
1 2015-06-10 11:25:55

解決方案2
0 2015-06-10 11:33:03