Python urllib.error.HTTPError：HTTP错误403：禁止

Question

I am new to python and was trying to write a script to download a csv file. 我是python的新手，正在尝试编写脚本来下载csv文件。 I am using python 3.6.1. 我正在使用python 3.6.1。 Here's the code 这是代码

from urllib import request

demo_csv_url = 'http://www.sample-videos.com/csv/Sample-Spreadsheet-100-rows.csv'

def downloadCSV(url):
    response = request.urlopen(url)
    csv = response.read()
    csvStr = str(csv)
    lines = csvStr.split('\\n')
    dest = r'csv.csv'
    fx = open(dest,"w")
    for line in lines:
        fx.write(line + '\n')
    fx.close()


downloadCSV(demo_csv_url)

When I run the script, I get the following error 运行脚本时，出现以下错误

Traceback (most recent call last):
  File "C:\Users\Vivek\Desktop\py tutorials\download_csv.py", line 23, in <module>
    downloadCSV(demo_csv_url)
  File "C:\Users\Vivek\Desktop\py tutorials\download_csv.py", line 12, in downloadCSV
    response = request.urlopen(url)
  File "D:\softwares\installed softwares\python\lib\urllib\request.py", line 223, in urlopen
    return opener.open(url, data, timeout)
  File "D:\softwares\installed softwares\python\lib\urllib\request.py", line 532, in open
    response = meth(req, response)
  File "D:\softwares\installed softwares\python\lib\urllib\request.py", line 642, in http_response
    'http', request, response, code, msg, hdrs)
  File "D:\softwares\installed softwares\python\lib\urllib\request.py", line 570, in error
    return self._call_chain(*args)
  File "D:\softwares\installed softwares\python\lib\urllib\request.py", line 504, in _call_chain
    result = func(*args)
  File "D:\softwares\installed softwares\python\lib\urllib\request.py", line 650, in http_error_default
    raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 403: Forbidden

I tried adding more headers like 我尝试添加更多标题

hdr = {'User-Agent': 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.11 (KHTML, like Gecko) Chrome/23.0.1271.64 Safari/537.11',
       'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
       'Accept-Charset': 'ISO-8859-1,utf-8;q=0.7,*;q=0.3',
       'Accept-Encoding': 'none',
       'Accept-Language': 'en-US,en;q=0.8',
       'Connection': 'keep-alive'}

and then opening the url as response = request.urlopen(url,hdr) But it throws in more errors. 然后以response = request.urlopen(url,hdr)打开response = request.urlopen(url,hdr)但是会引发更多错误。 Could you please let me know what I am doing wrong here. 您能否让我知道我在这里做错了什么。 Thanks 谢谢

Answer 1

That URL throws a 403 when you visit it directly in the browser, so it seems to be working as intended. 当您直接在浏览器中访问该URL时，它将引发403，因此它似乎可以正常工作。 If you want to catch 403's use a try / except . 如果要捕获403，请使用try / except 。

If the content is protected by an Auth header or Cookie, you'll need to figure out what those are and add those to the request. 如果内容受Auth标头或Cookie的保护，则需要确定它们是什么并将其添加到请求中。

Answer 2

You need to authenticate to access this data, you need to provide "password", "username" of some sort. 您需要进行身份验证才能访问此数据，需要提供某种“密码”，“用户名”。

Python urllib.error.HTTPError：HTTP错误403：禁止

问题描述

2 个解决方案

解决方案1
0 2017-04-03 16:14:43

解决方案2
0 2017-04-03 16:18:55

Python urllib.error.HTTPError：HTTP错误403：禁止

问题描述

2 个解决方案

解决方案1 0 2017-04-03 16:14:43

解决方案2 0 2017-04-03 16:18:55

解决方案1
0 2017-04-03 16:14:43

解决方案2
0 2017-04-03 16:18:55