[英]Jsoup split block element to preserve order of text and child elements
Lets say I have an html fragment like this:假设我有一个像这样的 html 片段:
<p>Generally speaking, in the U.S., if you want to <a href="\"https://web.archive.org/web/20210408195204/https://www.wsj.com/articles/investors-big-and-small-are-driving-stock-gains-with-borrowed-money-11617799940\"">
borrow money from your broker to buy stocks</a>, you are capped at 2-to-1 leverage. If you have $100, you can buy $200 worth of stock. Back in the olden days, you could have bought $300 or $500 or $1,000 of stock with your $100, borrowing the rest from your broker, but then a Great Depression happened and regulators clamped down on margin lending. </p>
I've parsed it with jsoup and its represented as an Element .我已经用jsoup对其进行了解析,并将其表示为Element 。
I'd like to be able to split the element into something like:我希望能够将元素拆分为:
"Generally speaking, in the US, if you want to "
"Generally speaking, in the US, if you want to "
<a href="\"https://web.archive.org/web/20210408195204/https://www.wsj.com/articles/investors-big-and-small-are-driving-stock-gains-with-borrowed-money-11617799940\"">
<a href="\"https://web.archive.org/web/20210408195204/https://www.wsj.com/articles/investors-big-and-small-are-driving-stock-gains-with-borrowed-money-11617799940\"">
"borrow money from your broker to buy stocks</a>, you are capped at 2-to-1 leverage. If you have $100, you can buy $200 worth of stock. Back in the olden days, you could have bought $300 or $500 or $1,000 of stock with your $100, borrowing the rest from your broker, but then a Great Depression happened and regulators clamped down on margin lending"
"borrow money from your broker to buy stocks</a>, you are capped at 2-to-1 leverage. If you have $100, you can buy $200 worth of stock. Back in the olden days, you could have bought $300 or $500 or $1,000 of stock with your $100, borrowing the rest from your broker, but then a Great Depression happened and regulators clamped down on margin lending"
And do this while preserving the ordering of those pieces.并在保留这些部分的顺序的同时做到这一点。
Things I've looked at so far:到目前为止我看过的东西:
What you want is childNodes()
method.你想要的是
childNodes()
方法。 Then you get a List<Node>
.然后你得到一个
List<Node>
。 You can work with these nodes, access their attributes and parents, but you lose the possibility to use select()
to access deeper tags because only Element
class has this method.您可以使用这些节点,访问它们的属性和父节点,但是您无法使用
select()
访问更深的标签,因为只有Element
class 具有此方法。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.