简体   繁体   English

如何使用JSoup从iFrame标记中的多个表中提取数据

[英]how to extract data from multiple table in iFrame tag using JSoup

I need to extract content of multible table in iframe tag, i'm using this code 我需要在iframe广告代码中提取multible表的内容,我正在使用此代码

Document doc = Jsoup.connect("http://pubedt.uvsq.fr/VER/g4227.html")
            .header("User-Agent", "Mozilla/5.0 (X11; Linux x86_64) Gecko/20100101 Firefox/19.0").timeout(100*1000)
            .get();
    for (Element table : doc.select("table")) {
    //    System.out.println(table.html());
        for (Element row : table.select("tr")) {
            Elements tds = row.select("td");
            if (tds.size() > 6) {
                System.out.println(tds.get(0).text() + ":" + tds.get(1).text());
            }
        }
    }

looks like you are on the right track, but tds.size() never exceeds 6 entries: 看起来您处在正确的轨道上,但tds.size()永远不会超过6个条目:

Document doc = Jsoup
        .connect("http://pubedt.uvsq.fr/VER/g4227.html")
        .header("User-Agent",
                "Mozilla/5.0 (X11; Linux x86_64) Gecko/20100101 Firefox/19.0")
        .timeout(100 * 1000).get();
for (Element table : doc.select("table")) {
    // System.out.println(table.html());
    for (Element row : table.select("tr")) {
        Elements tds = row.select("td");
        // log added here
        System.out.println("tds.size(): " + tds.size());
        if (tds.size() > 6) {
            System.out.println(tds.get(0).text() + ":"
                    + tds.get(1).text());
        }
    }
}

output: 输出:

tds.size(): 6
tds.size(): 1
tds.size(): 1
tds.size(): 1
tds.size(): 0
tds.size(): 1
tds.size(): 1
tds.size(): 1
tds.size(): 1
tds.size(): 1
tds.size(): 0
tds.size(): 1
tds.size(): 1

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM