[英]Using JSOUP to extract data from HTML tables
所以我目前正在嘗試從( http://games.espn.go.com/ffl/leaders ?)中抓取所有數據並存儲它。 我目前的方法只是翻錄網站,從中提取所有數據。 然而,在看到這是多么低效之后,我開始對 JSOUP 做一些研究。 我設法找到了這篇關於在 espn 中使用 JSOUP 的 stackoverflow 帖子。 使用 JSoup 提取 HTML 表格內容。 我嘗試使用相同的方法,但是我不知道如何從http://games.espn.go.com/ffl/leaders獲取所有表格信息? 任何幫助深表感謝!
Document doc = Jsoup
.connect("http://games.espn.go.com/ffl/leaders")
.userAgent(
"Mozilla/5.0 (Windows NT 6.2; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/32.0.1667.0 Safari/537.36")
.ignoreContentType(true).timeout(0).get();
Elements elements = doc.select("table[class=playerTableTable tableBody]");
Elements rows = elements.get(0).select("tr[class^=pncPlayerRow]");
for (Element row : rows) {
Elements tds = row.select("td");
if(tds.size() != 24) continue;
String mPLAYERTEAMPOS = tds.get(0).text();
String mWK_OPP = tds.get(2).text();
String mWK_STATUSET = tds.get(3).text();
String mPASSING_CA = tds.get(5).text();
String mPASSING_YDS = tds.get(6).text();
String mPASSING_TD = tds.get(7).text();
String mPASSING_INT = tds.get(8).text();
String mRUSHING_RUSH = tds.get(10).text();
String mRUSHING_YDS = tds.get(11).text();
String mRUSHING_TD = tds.get(12).text();
String mRECEIVING_REC = tds.get(14).text();
String mRECEIVING_YDS = tds.get(15).text();
String mRECEIVING_TD = tds.get(16).text();
String mRECEIVING_TAR = tds.get(17).text();
String mMISC_2PC = tds.get(19).text();
String mMISC_FUML = tds.get(20).text();
String mMISC_TD = tds.get(21).text();
String mTOTAL_PTS = tds.get(23).text();
System.out.println("mPLAYERTEAMPOS\t\t\t"+ mPLAYERTEAMPOS);
System.out.println("mWK_OPP \t\t\t"+mWK_OPP);
System.out.println("mWK_STATUSET\t\t\t" + mWK_STATUSET);
System.out.println("mPASSING_CA\t\t\t"+mPASSING_CA);
System.out.println("mPASSING_YDS\t\t\t"+mPASSING_YDS);
System.out.println("mPASSING_TD\t\t\t"+mPASSING_TD);
System.out.println("mPASSING_INT\t\t\t"+mPASSING_INT);
System.out.println("mRUSHING_RUSH\t\t\t"+mRUSHING_RUSH);
System.out.println("mRUSHING_YDS\t\t\t"+mRUSHING_YDS);
System.out.println("mRUSHING_TD\t\t\t"+mRUSHING_TD);
System.out.println("mRECEIVING_REC\t\t\t"+mRECEIVING_REC);
System.out.println("mRECEIVING_YDS\t\t\t"+mRECEIVING_YDS);
System.out.println("mRECEIVING_TD\t\t\t"+mRECEIVING_TD);
System.out.println("mRECEIVING_TAR\t\t\t"+mRECEIVING_TAR);
System.out.println("mMISC_2PC\t\t\t"+mMISC_2PC);
System.out.println("mMISC_FUML\t\t\t"+mMISC_FUML);
System.out.println("mMISC_TD\t\t\t"+mMISC_TD);
System.out.println("mTOTAL_PTS\t\t\t"+mTOTAL_PTS);
}
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.