简体   繁体   English

Will Jaunt web scraper能否抓取这个javascript网站

[英]Will Jaunt web scraper be capable of scraping this javascript site

I have never done web scraping before, actually just 3 hours ago I google the word web scraping to see what it means... so this is my level of competence on the subject, but I have a task to scrape some numbers for different football matches of this website "betstars.uk" and from what I see it is a javascript website (is it ?) which makes the already hard task for me even harder, so can Jaunt tool for JAVA do this job or I need something else? 我以前从未做过网页抓取,实际上只是3个小时前我在谷歌上搜索网页抓取来看看它意味着什么...所以这是我对这个主题的能力水平,但我有一个任务是为不同的足球刮掉一些数字这个网站的匹配“betstars.uk”,从我看到它是一个javascript网站(是吗?),这使我已经很难的任务更难,所以JAVA的Jaunt工具可以完成这项工作还是我需要别的东西? I am asking because to avoid spending more than an hour learning how to use it just to find out it can't do the job 我问,因为要避免花费超过一个小时学习如何使用它只是为了找出它无法完成这项工作

For some reason I cannot load the website so I can't tell you if it uses javascript to load content or not. 出于某种原因,我无法加载网站,所以我不能告诉你它是否使用javascript加载内容。 It's impossible to scrape a javascript based website with Jaunt because it's a basic web scraper library and it doesnt load javascript at all. 使用Jaunt抓取基于javascript的网站是不可能的,因为它是一个基本的网络刮刀库,它根本不加载javascript。 Though, in case the site uses javascript indeed, you could use htmlUnit to load javascript content and scrape the information you need. 但是,如果网站确实使用了javascript,您可以使用htmlUnit加载javascript内容并抓取您需要的信息。 Here is an easy tutorial on How to Scrape Javascript in Java 这是一个关于如何在Java中使用Javascript的简单教程

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM