使用Jsoup获取没有属性的元素

Question

I have the following html, using Jsoup I'm trying to extract the text in the p section which does not have any attributes (the text "Some text 2" and not "Some text 1"). 我有以下html，使用Jsoup，我试图提取p部分中没有任何属性的文本（文本“ Some text 2”而不是“ Some text 1”）。

<div id="intro">
    <h1 class="some class">
    <p id="some_id">
        Some text 1
    </p>
    <p>
        Some text 2
    </p>
</div>

I tried using the following Jsoup expression: 我尝试使用以下Jsoup表达式：

div[id=intro] > p:not(:has(@*))

But it doesn't work. 但这是行不通的。 Thanks for your help. 谢谢你的帮助。

Answer 1

I think you can use the JSOUP CSS selector p:not([^]) , which would select any p that does not match having an attribute starting with anything. 我认为您可以使用JSOUP CSS选择器 p:not([^]) ，该选择器将选择任何不匹配且具有以任何内容开头的属性的p。

String html = "<div id=\"intro\">"
        + "<h1 class=\"some class\">"
        + "<p id=\"some_id\">"
        +   "Some text 1"
        + "</p>"
        + "<p name=\"some_name\">"
        +   "Some text A"
        + "</p>"
        + "<p data>"
        +   "Some text B"
        + "</p>"
        +"<p>"
        +   "Some text 2"
        +"</p>"
        +"</div> ";

Document doc = Jsoup.parse(html);
Elements els = doc.select("p:not([^])");
for (Element el:els){
    System.out.println(el.text());
}

the above example will only print out 上面的例子只会打印出来

Some text 2

because only this p element has no attributes. 因为只有这个p元素没有属性。

Note that the selector p[^] will pick all p elements that do have an attribute. 请注意，选择器p[^]将选择所有具有属性的p个元素。

使用Jsoup获取没有属性的元素

问题描述

1 个解决方案

解决方案1
4 已采纳 2015-09-17 10:19:05

使用Jsoup获取没有属性的元素

问题描述

1 个解决方案

解决方案1 4 已采纳 2015-09-17 10:19:05

解决方案1
4 已采纳 2015-09-17 10:19:05