如何在Jsoup中解析此特定数据

Question

即时通讯不知道如何解析jsoup中的“ bundesweit”和日期字符串，因为它们都具有相同的类名（col2）

<strong><a href="/stellenangebote/109499-python-entwickler-gui-testing?page=1&amp;query%5Bcity%5D=&amp;query%5Bradius%5D=100&amp;query%5Btext%5D=Python" title="Python-Entwickler (m/w) GUI Testing">Python-Entwickler (m/w) GUI Testing</a></strong>
<br>
<a class="job-offer-teaser-company" href="/unternehmen/ruecker-gmbh" title="Rücker">Rücker</a>
</div>
<div class='col2'>
bundesweit
</div>
<div class='col2'>
08.12.2013
</div>

我尝试了这个：

Elements jobTitleElement = element.select("a");
                        Elements companyNameElement = element.select(".job-offer-teaser-company");
                        Elements locationElement = element.select(".cal2");

非常感谢你

Answer 1

如果HTML遵循相同的结构，则只需选择它们两者，然后使用索引将它们拆分。

//Get the HTML
Document doc = Jsoup.parse(html); 
//or
Document doc = Jsoup.connect(url).get();

//Select the elements
Elements col2Elements = doc.select("div.col2"); //This will return a collection of Element objects
String firstElement = col2Elements.get(0).text(); //Get the first
String secondElement = col2Elements.get(1).text(); //Get the second

Answer 2

您可以使用以下代码：

Document doc = Jsoup.parse(html);
Elements elements = doc.getElementsByClass("col2");
String bundesweitContent = elements.get(0).text();
System.out.println(bundesweitContent); // You get "bundesweit"

参考：

http://jsoup.org/cookbook/introduction/parsing-a-document

http://jsoup.org/apidocs/

如何在Jsoup中解析此特定数据

问题描述

2 个解决方案

解决方案1
1 已采纳 2014-08-04 10:18:58

解决方案2
1 2014-08-04 10:19:42

如何在Jsoup中解析此特定数据

问题描述

2 个解决方案

解决方案1 1 已采纳 2014-08-04 10:18:58

解决方案2 1 2014-08-04 10:19:42

解决方案1
1 已采纳 2014-08-04 10:18:58

解决方案2
1 2014-08-04 10:19:42