简体   繁体   English

如何从网站上抓取链接和图像?

[英]How do I scrape links and images from a website?

I'm new to javascript and would like my first project to be this: 我是javascript新手,希望我的第一个项目是:

im trying to take the name of channels and thumbnails from this site from source code using a javascript or php to display all thumbnails of channels on my website. 我试图使用javascript或php从源代码中获取本网站的频道和缩略图的名称,以显示我网站上所有频道的缩略图。

this is i want to catch the a href="this-link" and the img src but from all channels: 这是我想从所有渠道捕获a href =“ this-link”和img src的方法:

<div class="browseThumb">
                <a href="/realmanpwns" target="_top"><img src="//thumbnails.vaughnsoft.com/1427753239/fetch/live/realmanpwns.png" class"browseThumb" width="200" height="113" /></a>
                <div class="browseTxt" onclick="window.location = '/realmanpwns';">
                    <a href="/realmanpwns" target="_top" class="browseTxt">realmanpwns</a>
                </div>
            </div>

Any guidance for how to get started is appreciated. 任何有关如何入门的指导都值得赞赏。

I used scrapy on a recent project. 我在最近的项目中使用了scrapy。 Didn't know python at the time but I've dabbled with a few other languages and it wasn't difficult to pick up. 当时不了解python,但我涉猎了其他几种语言,因此学习起来并不难。

They have a ton of resources to learn from. 他们有大量的资源可以学习。

Here's the direct link: http://scrapy.org 这是直接链接: http : //scrapy.org

PS. PS。 I am in no way affiliated with them, just a happy user. 我绝不隶属于他们,只是一个快乐的用户。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM