如何将 web 抓取的图片保存到文件夹中？（Python）

Question

我如何才能将我从 web 抓取中获得的每张图像存储到一个文件夹中？ 我目前使用 Google Colab，因为我只是在练习。 我想将它们存储在我的 Google Drive 文件夹中。

这是我的 web 抓取代码：

import requests 
from bs4 import BeautifulSoup 

def getdata(url):
  r = requests.get(url)
  return r.text

htmldata = getdata('https://www.yahoo.com/')
soup = BeautifulSoup(htmldata, 'html.parser')

imgdata = []
for i in soup.find_all('img'):
  imgdata = i['src']
  print(imgdata)

Answer 1

我在脚本运行的文件夹中手动创建了一个pics文件夹，用于将图片存储在其中。 比起我在 for 循环中更改了您的代码，以便将其附加 url 到imgdata列表。 try except块在那里是因为并非列表中的每个 url 都是有效的。

import requests 
from bs4 import BeautifulSoup 

def getdata(url):
    r = requests.get(url)
    return r.text

htmldata = getdata('https://www.yahoo.com/')
soup = BeautifulSoup(htmldata, 'html.parser')

imgdata = []
for i in soup.find_all('img'):
    imgdata.append(i['src']) # made a change here so its appendig to the list
    


filename = "pics/picture{}.jpg"
for i in range(len(imgdata)):
    print(f"img {i+1} / {len(imgdata)+1}")
    # try block because not everything in the imgdata list is a valid url
    try:
        r = requests.get(imgdata[i], stream=True)
        with open(filename.format(i), "wb") as f:
            f.write(r.content)
    except:
        print("Url is not an valid")

Answer 2

foo.write('whatever')
foo.close()

如何将 web 抓取的图片保存到文件夹中？（Python）

问题描述

2 个解决方案

解决方案1
1 已采纳 2022-05-20 17:24:22

解决方案2
0

如何将 web 抓取的图片保存到文件夹中？ （Python）

问题描述

2 个解决方案

解决方案1 1 已采纳 2022-05-20 17:24:22

解决方案2 0

如何将 web 抓取的图片保存到文件夹中？（Python）

解决方案1
1 已采纳 2022-05-20 17:24:22

解决方案2
0