[英]Why is “requests” in Python unable to download this url image but the browser can?
[英]Unable to download url link with requests in Python
目標是從cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz
下載tar.gz
使用wget
可以毫無問題地下載該文件。
!wget cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz --no-check-certificate
但是,使用requests
下載文件
import requests
url='cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz'
r = requests.get(url)
返回錯誤
MissingSchema Traceback (most recent call last)
<ipython-input-11-fa35f2c0ddc0> in <module>()
1 url='cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz'
----> 2 r = requests.get(url)
5 frames
/usr/local/lib/python3.7/dist-packages/requests/models.py in prepare_url(self, url, params)
386 error = error.format(to_native_string(url, 'utf8'))
387
--> 388 raise MissingSchema(error)
389
390 if not host:
MissingSchema: Invalid URL 'cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz': No schema supplied. Perhaps you meant http://cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz?
我可以知道是什么問題嗎?
您的url
變量中缺少 http:// 或 https:// (架構,如錯誤消息所述)。
url = 'https://cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz'
你錯過了 http 標頭
import requests
requests.get("http://cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz")
這也應該有效
import wget
wget.download("http://cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz", out="YOUR_PATH")
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.