简体   繁体   English

Java - 网络爬虫问题

[英]Java - Web Scraper Issue

I'm trying to scrape this page https://www.teamrankings.com/nba/team/cleveland-cavaliers but I need the grid that displays when you click 'Betting View'... I'm having some trouble.我正在尝试抓取此页面https://www.teamrankings.com/nba/team/cleveland-cavaliers但我需要在您单击“投注视图”时显示的网格...我遇到了一些麻烦。 Is it even possible with the code I have?甚至可以使用我拥有的代码吗? I'm getting null when I print div.当我打印 div 时,我得到了空值。

    String teamLink = "https://www.teamrankings.com/nba/team/cleveland-cavaliers/.betting"; // + Teams[i] + "/";

    Document doc = Jsoup.connect(teamLink).get();

    final WebClient webClient = new WebClient(BrowserVersion.FIREFOX_45);
    HtmlPage page = webClient.getPage(teamLink);
    final HtmlDivision div = page.getFirstByXPath("div.btn-group");

    System.out.println(div);
    page = div.click();

There's no need to click the button.无需单击按钮。 The data from both tabs seems to be in both tabs.来自两个选项卡的数据似乎都在两个选项卡中。 The button appears to run a JavaScript function that shows and hides columns.该按钮似乎运行了一个显示和隐藏列的 JavaScript 函数。 Just fetch the whole table with ID DataTables_Table_0 and trash the columns you don't care about.只需使用 ID DataTables_Table_0获取整个表并删除您不关心的列。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM