如何使用 beautifulsoap 和 python scrape 受密码保护的网站？

Question

https://khaiexch.com/ How can i scrape this website. https://khaiexch.com/我怎样才能抓取这个网站。 I am trying something like this but it is giving 403 as response我正在尝试这样的事情，但它给出了 403 作为响应

login_url = 'https://khaiexch.com/login/login_action'
login_data = {
    "email": "***", 
      "password": "****", 
    "compute": "****",
    "submitted": "****"
    }
r = requests.post(login_url, data=login_data)

But the above code is giviing me 403 error, Can any one please guide me how can I scrape this website?但是上面的代码给了我403错误，谁能指导我如何抓取这个网站？

Answer 1

Try requesting with get method and use headers aswell that would be something like this:尝试使用get方法请求并使用标头，这将是这样的：

import requests

login_url = 'https://khaiexch.com/login/login_action'
login_data = {
    "email": "***", 
    "password": "****", 
    "compute": "****",
    "submitted": "****"
    }
headers     = {'User-Agent': 'okhttp/3.12.1'}
r = requests.get(login_url, data=login_data, headers=headers)
result = r.status_code
print (result)

如何使用 beautifulsoap 和 python scrape 受密码保护的网站？

问题描述

1 个解决方案

解决方案1
1 2020-08-05 14:08:52

如何使用 beautifulsoap 和 python __scrape__ 受密码保护的网站？

问题描述

1 个解决方案

解决方案1 1 2020-08-05 14:08:52

如何使用 beautifulsoap 和 python scrape 受密码保护的网站？

解决方案1
1 2020-08-05 14:08:52