简体   繁体   English

Jsoup在网站上看不到一些文本

[英]Jsoup not seeing some text on website

Currently I am making a program (in Java) that grabs all the streamers on twitch (Videogame streaming site) from a given URL eg and lists them into a text file using Jsoup. 目前,我正在编写一个程序(用Java),该程序从给定的URL中抓取twitch(视频游戏流媒体站点)上的所有流光, 例如,并使用Jsoup将它们列出到文本文件中。

However, no matter what I try, it seems like I can't get the streamer's names. 但是,无论我尝试什么,似乎都无法获得拖缆的名称。 After a while I discovered that the page source for some reason does not contain the streamer's names which I think could be the problem? 一段时间后,我发现页面源出于某种原因不包含流媒体的名称,我认为这可能是问题所在?

Here is my code currently. 这是我目前的代码。

public static void main(String[] args) throws IOException {
    int i = 0;

    PrintWriter streamerwriter = new PrintWriter("streamer.txt", "UTF-8");
    Document doc = Jsoup.connect(https://www.twitch.tv/directory/game/Hearthstone%3A%20Heroes%20of%20Warcraft).get();
    Elements streamers = doc.getElementsByClass("js-profile-link");

    for (Element streamer : streamers) {
        i++;
        System.out.println(i + "." + streamer.text());
        streamerwriter.println(i + "." + streamer.text());
    }

    streamerwriter.close();
}

Any help would be greatly appreciated. 任何帮助将不胜感激。

you don't need to parse webpage.Because twitch has an api to select streamers. 您不需要解析网页。因为twitch具有用于选择彩带的api。

https://streams.twitch.tv/kraken/streams?limit=20&offset=0&game=Hearthstone%3A+Heroes+of+Warcraft&broadcaster_language=&on_site=1 https://streams.twitch.tv/kraken/streams?limit=20&offset=0&game=Hearthstone%3A+Heroes+of+Warcraft&broadcaster_language=&on_site=1

so you should parse json data 所以你应该解析json数据

if you want to know why you don't see streamers in jsoup because of lazy loading.because that part you want to parse is loaded lazily.You should know that lazy request and parse that url by jsoup which i found and wrote up.(twitch api) 如果您想知道为什么由于延迟加载而在jsoup中看不到彩带,因为您要解析的部分是延迟加载的。您应该知道该延迟请求并使用我找到并编写的jsoup解析该URL。(抽搐API)

Please check this question out: how to use Jsoup in site that has lazyload scrollLoader.js 请检查以下问题: 如何在具有lazyload scrollLoader.js的网站中使用Jsoup

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM