jsoup：解析特定標記之后的某個標記的數據

Question

我試圖通過Java中的jsoup從過去3天解析某些信息-_-，這是我的代碼：

Document document = Jsoup.connect(urlofpage).get();
Elements links = document.select(".contentBox");

    for (Element link : links) {
        // String name = link.text();
        String title = link.select("h2").text();
        String content = link.select("p").text();
        System.out.println(title);
        System.out.println(content);
    }

它是按照指示獲取數據，取出h2和p的數據分開，但問題是，我想解析<p>標簽內的數據，這是在每個<h2>標簽之后。

例如（HTML內容）：

<h2>main content</h2>
<div class="acx"><div>
<p>content</p>
<p>content 2</p>

<h2>content 2</h2>
<div class="acx"><div>
<p>new content od 2</p>
<p>new 2</p>

現在它應該像（在數組中）獲取：

array[0] = "content content 2",
array[1] = "new content od 2 new 2",

有解決方案嗎

Answer 1

您可以使用“〜”下一個元素選擇器。 例如

link.select("h2 ~ p").get(0).text(); // returns "content"
link.select("h2 ~ p").get(1).text(); // returns "new content od 2"

Answer 2

只需使用您的初始方法迭代所選.contentBox類中的所有必要標記：

Document document = Jsoup.connect(urlofpage).get();
Elements links = document.select(".contentBox");

       for (Element link : links) {
            for (Element h2Tag : link.select("h2"))
            {
               System.out.println(h2Tag.text());
            }
            for (Element pTag : link.select("p"))
            {
               System.out.println(pTag.text());
            }
         }

jsoup：解析特定標記之后的某個標記的數據

問題描述

2 個解決方案

解決方案1
0 2017-05-03 19:18:49

解決方案2
0 2017-05-03 19:28:30

jsoup：解析特定標記之后的某個標記的數據

問題描述

2 個解決方案

解決方案1 0 2017-05-03 19:18:49

解決方案2 0 2017-05-03 19:28:30

解決方案1
0 2017-05-03 19:18:49

解決方案2
0 2017-05-03 19:28:30