[英]How do I scrape links and images from a website?
I'm new to javascript and would like my first project to be this: 我是javascript新手,希望我的第一个项目是:
im trying to take the name of channels and thumbnails from this site from source code using a javascript or php to display all thumbnails of channels on my website. 我试图使用javascript或php从源代码中获取本网站的频道和缩略图的名称,以显示我网站上所有频道的缩略图。
this is i want to catch the a href="this-link" and the img src but from all channels: 这是我想从所有渠道捕获a href =“ this-link”和img src的方法:
<div class="browseThumb">
<a href="/realmanpwns" target="_top"><img src="//thumbnails.vaughnsoft.com/1427753239/fetch/live/realmanpwns.png" class"browseThumb" width="200" height="113" /></a>
<div class="browseTxt" onclick="window.location = '/realmanpwns';">
<a href="/realmanpwns" target="_top" class="browseTxt">realmanpwns</a>
</div>
</div>
Any guidance for how to get started is appreciated. 任何有关如何入门的指导都值得赞赏。
I used scrapy on a recent project. 我在最近的项目中使用了scrapy。 Didn't know python at the time but I've dabbled with a few other languages and it wasn't difficult to pick up.
当时不了解python,但我涉猎了其他几种语言,因此学习起来并不难。
They have a ton of resources to learn from. 他们有大量的资源可以学习。
Here's the direct link: http://scrapy.org 这是直接链接: http : //scrapy.org
PS. PS。 I am in no way affiliated with them, just a happy user.
我绝不隶属于他们,只是一个快乐的用户。
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.