簡體   English   中英

從數組中的jsoup獲得文本獨立的標簽

[英]Get the text indie a tag from jsoup in an array

我正在使用jsoup解析html元素

我在做什么

 Elements e = document.select(".doc-type-list li a");
         System.out.println(e);

這給我這個

<a class="doc-type doc-type-mtm" href="/mtm/a-d-topical.html">A &amp; D topical</a>
<a class="doc-type doc-type-cdi" href="/cdi/a-d-cracked-skin-relief-cream.html">A + D Cracked Skin Relief cream</a>
<a class="doc-type doc-type-mtm" href="/mtm/a-200-lice-treatment.html">A-200 Lice Treatment</a>
<a class="doc-type doc-type-mtm" href="/mtm/a-25.html">A-25</a>
<a class="doc-type doc-type-cons" href="/cons/a-caro-25.html">A-Caro-25</a>
<a class="doc-type doc-type-cons" href="/cons/a-g-profen.html">A-G Profen</a>
<a class="doc-type doc-type-pro" href="/pro/a-hydrocort.html">A-Hydrocort</a>
<a class="doc-type doc-type-mtm" href="/mtm/a-methapred-injection.html">A-Methapred injection</a>
<a class="doc-type doc-type-cons" href="/cons/a-methapred.html">A-Methapred</a>
<a class="doc-type doc-type-cdi" href="/cdi/a-methapred-solution.html">A-methapred solution</a>
<a class="doc-type doc-type-pro" href="/pro/a-methapred-injection.html">A-Methapred Injection</a>
<a class="doc-type doc-type-mtm" href="/mtm/a-phedrin.html">A-Phedrin</a>
<a class="doc-type doc-type-cdi" href="/cdi/a-spaz.html">A-Spaz</a>
<a class="doc-type doc-type-cdi" href="/cdi/a-tan-12x-suspension.html">A-Tan 12X suspension</a>
<a class="doc-type doc-type-mtm" href="/mtm/a-e-r-witch-hazel.html">A.E.R. Witch Hazel</a>
<a class="doc-type doc-type-cons" href="/cons/a-b-otic.html">A / B Otic</a>
<a class="doc-type doc-type-mtm" href="/mtm/a-fish-oil.html">A / Fish Oil</a>
<a class="doc-type doc-type-mtm" href="/mtm/a-t-s.html">A / T / S</a>
<a class="doc-type doc-type-cons" href="/cons/a-t-s-topical.html">A / T / S Topical</a>
<a class="doc-type doc-type-monograph" href="/monograph/a1-proteinase-inhibitor-human.html">a1-Proteinase Inhibitor (Human)</a>
<a class="doc-type doc-type-cons" href="/cons/a200-maximum-strength-topical.html">A200 Maximum Strength Topical</a>
<a class="doc-type doc-type-cons" href="/cons/a200-time-tested-formula-topical.html">A200 Time-Tested Formula Topical</a>
<a class="doc-type doc-type-mtm" href="/mtm/abacavir.html">abacavir</a>
<a class="doc-type doc-type-cons" href="/cons/abacavir.html">abacavir</a>
<a class="doc-type doc-type-cdi" href="/cdi/abacavir-solution.html">abacavir solution</a>
<a class="doc-type doc-type-cdi" href="/cdi/abacavir.html">abacavir</a>
<a class="doc-type doc-type-ppa" href="/ppa/abacavir.html">Abacavir</a>
<a class="doc-type doc-type-mtm" href="/mtm/abacavir-and-lamivudine.html">abacavir and lamivudine</a>
<a class="doc-type doc-type-cons" href="/cons/abacavir-and-lamivudine.html">abacavir and lamivudine</a>
<a class="doc-type doc-type-ppa" href="/ppa/abacavir-and-lamivudine.html">Abacavir and Lamivudine</a>
<a class="doc-type doc-type-pro" href="/pro/abacavir-and-lamivudine-tablets.html">Abacavir and Lamivudine Tablets</a>
<a class="doc-type doc-type-monograph" href="/monograph/abacavir-sulfate.html">Abacavir Sulfate</a>
<a class="doc-type doc-type-pro" href="/pro/abacavir-sulfate-tablets.html">Abacavir Sulfate Tablets</a>
<a class="doc-type doc-type-mtm" href="/mtm/abacavir-dolutegravir-and-lamivudine.html">abacavir, dolutegravir, and lamivudine</a>
<a class="doc-type doc-type-cons" href="/cons/abacavir-dolutegravir-and-lamivudine.html">abacavir, dolutegravir, and lamivudine</a>
<a class="doc-type doc-type-cdi" href="/cdi/abacavir-dolutegravir-and-lamivudine.html">abacavir, dolutegravir, and lamivudine</a>
<a class="doc-type doc-type-ppa" href="/ppa/abacavir-dolutegravir-and-lamivudine.html">Abacavir, Dolutegravir, and Lamivudine</a>
<a class="doc-type doc-type-pro" href="/pro/abacavir-lamivudine-and-zidovudinetablets.html">Abacavir, Lamivudine and ZidovudineTablets</a>
<a class="doc-type doc-type-mtm" href="/mtm/abacavir-lamivudine-and-zidovudine.html">abacavir, lamivudine, and zidovudine</a>
<a class="doc-type doc-type-cons" href="/cons/abacavir-lamivudine-and-zidovudine.html">abacavir, lamivudine, and zidovudine</a>

但我希望文本indside a陣列中的哪個是

A & D topical
A + D Cracked Skin Relief cream
A-200 Lice Treatment....
.............

您可以使用一種方法轉換為列表。

public List<String> contents(Elements elements) {
    List<String> list=new ArrayList<>();
    for (Element element : elements) {
        list.add(element.text());
    }
    return list;
}

您需要遍歷所有元素並獲取內部HTML:

final Elements e = document.select(".doc-type-list li a");
for (final Element elem : e)
{
    System.out.println(elem.html());
}

獲取元素自己的文本

for (Element element : e) { System.out.println(element.ownText()); }

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM