Python 从幻灯片中抓取图像

Question

我试图从我学校网站的主页上抓取图片。 但问题是这些图像出现在幻灯片中。 幻灯片中总共有 9 张图片，我想把它们全部刮掉。 我不知道该怎么做，因为我对requests所能做的就是抓取它的第一张图片。 我不想使用selenium因为它很慢。 如何仅使用requests和BeautifulSoup从幻灯片中抓取所有图像？ 任何帮助，将不胜感激。 谢谢！

PS：我知道我必须提供我到目前为止编写的代码，但问题是我还没有尝试过任何东西，因为我真的不知道该怎么做，因为这是第一次我正在从幻灯片中抓取图像，所以请原谅我。 另外，我已经看过这个问题的答案，但我不知道如何在我的程序中实现答案。

Answer 1

要获取滑块中的 9 个图像，您可以使用以下示例：

import requests
from bs4 import BeautifulSoup


url = 'http://www.buddingmindsinternationalschool.com/'
soup = BeautifulSoup(requests.get(url).content, 'html.parser')

for img in soup.select('#rev_slider_2_1 img'):
    print(img['src'])

印刷：

http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/banner-4.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/banner-2.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/banner-7.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/10/bannerfour.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/bannerseven.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/bannerfives-1.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/banner-5.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/bannereight.jpg
http://www.buddingmindsinternationalschool.com/wp-content/uploads/2017/11/banner-6.jpg

Answer 2

尝试这个：

import requests
from bs4 import BeautifulSoup

url = "http://www.buddingmindsinternationalschool.com/"
soup = BeautifulSoup(requests.get(url).content, "html.parser")

for tag in soup.select(".rev-slidebg"):
    img_name = tag['src'].split('/')[-1]

    with open(img_name, "wb") as f:
        req = requests.get(tag['src'])
        f.write(req.content)

Python 从幻灯片中抓取图像

问题描述

2 个解决方案

解决方案1
1 已采纳 2020-10-12 17:44:43

解决方案2
1 2020-10-12 17:50:16

Python 从幻灯片中抓取图像

问题描述

2 个解决方案

解决方案1 1 已采纳 2020-10-12 17:44:43

解决方案2 1 2020-10-12 17:50:16

解决方案1
1 已采纳 2020-10-12 17:44:43

解决方案2
1 2020-10-12 17:50:16