使用Jsoup的特定标记后的HTML内容

Question

I have a String with HTML formated text (not a whole webpage). 我有一个带有HTML格式文本的字符串（不是整个网页）。

How can I get all the HTML content after a particular tag using Jsoup? 如何使用Jsoup获得特定标记后的所有HTML内容？

To be more concret. 更具体。 Assuming I have the following string: 假设我有以下字符串：

String input = "<div>a</div><p>b</p><strong>c</strong>";

I would like to get: 我想得到：

String output = "<p>b</p><strong>c</strong>";

Hence I am doing 因此我在做

Document doc = Jsoup.parseBodyFragment(input); // parse
Element p = doc.select("p"); // select p

And I have a hard time firguring out how to output what after p. 而且我很难弄清楚如何在p之后输出什么。 Let assume for simplicity that p is unique. 为了简单起见，假设p是唯一的。

Another input/output (as asked): 另一个输入/输出（根据要求）：

String input = "<br /><strong>a</strong><strong>b</strong><p>c</p><div>d</div><br />";
String output = "<p>c</p><div>d</div><br />";

Thank you in advance. 先感谢您。

Answer 1

Here's some code - hope it helps you a bit: 这是一些代码-希望它能对您有所帮助：

String input = "<div>a</div><p>b</p><strong>c</strong>";


Document doc = Jsoup.parse(input);
Elements elements = doc.select("p ~ *");

Elements group = new Elements();
group.add(elements.first().previousElementSibling());


for( Element element : elements )
{
    group.add(element);
}

// You can work with 'group' too
String output = group.toString();

Output: 输出：

example 1: 范例1：

<p>b</p>
<strong>c</strong>

example 2: 范例2：

<p>c</p>
<div>
 d
</div>
<br />

使用Jsoup的特定标记后的HTML内容

问题描述

1 个解决方案

解决方案1
4 已采纳 2012-09-13 12:26:04

使用Jsoup的特定标记后的HTML内容

问题描述

1 个解决方案

解决方案1 4 已采纳 2012-09-13 12:26:04

解决方案1
4 已采纳 2012-09-13 12:26:04