Java使用jsoup解析html表中的数据

Question

我想从链接的表中获取数据。

链接：

https://www.nasdaq.com/symbol/aapl/financials?query=balance-sheet

我已经尝试过我的代码，但是没有用

public static void main(String[] args) {
    try {
        Document doc = Jsoup.connect("https://www.nasdaq.com/symbol/aapl/financials?query=balance-sheet").get();
        Elements trs = doc.select("td_genTable");



        for (Element tr : trs) {
            Elements tds = tr.getElementsByTag("td");
            Element td = tds.first();
            System.out.println(td.text());
        }
    } catch (IOException e) {
        e.printStackTrace();
    }
}

有谁能够帮助我？ 为了使其正常工作

我没有得到该表的输出。 什么都没发生。

Answer 1

测试您的代码后，我得到了Read time out问题。 在Google上查找时，我发现了这篇文章，建议添加一个用户代理对其进行修复，它对我有用 。 所以，你可以试试看

public static void main(String[] args) {
    try {
        // add user agent
        Document doc = Jsoup.connect("https://www.nasdaq.com/symbol/aapl/financials?query=balance-sheet")
                .userAgent("Mozilla/5.0").get();
        Elements trs = doc.select("tr");
        for (Element tr : trs) {
            Elements tds = tr.select(".td_genTable");
            // avoid tr headers that produces NullPointerException
            if(tds.size() == 0) continue;
            // look for siblings (see the html structure of the web)
            Element td = tds.first().siblingElements().first();
            System.out.println(td.text());
        }
    } catch (IOException e) {
        e.printStackTrace();
    }
}

我添加了用户代理选项，并修复了一些查询错误 。 这将对您开始工作很有用；）

Java使用jsoup解析html表中的数据

问题描述

1 个解决方案

解决方案1
0 已采纳 2019-01-09 18:06:59

Java使用jsoup解析html表中的数据

问题描述

1 个解决方案

解决方案1 0 已采纳 2019-01-09 18:06:59

解决方案1
0 已采纳 2019-01-09 18:06:59