Python 3 requests.get（）。text返回未编码的字符串

Question

Python 3 requests.get().text returns unencoded string. Python 3 requests.get（）。text返回未编码的字符串。 If I execute: 如果我执行：

import requests
request = requests.get('https://google.com/search?q=Кто является президентом России?').text.lower()
print(request)

I get kind of this: 我得到这样的：

&#1050;&#1090;&#1086; &#1103;&#1074;&#1083;&#1103;&#1077;&#1090;&#1089;&#1103; &#1087;&#1088;&#1077;&#1079;&#1080;&#1076;

I've tried to change google.com to google.ru 我试图将google.com更改为google.ru

If I execute: 如果我执行：

import requests
request = requests.get('https://google.ru/search?q=Кто является президентом России?').text.lower()
print(request)

I get kind of this: 我得到这样的：

d0%9a%d1%82%d0%be+%d1%8f%d0%b2%d0%bb%d1%8f%d0%b5%d1%82%d1%81%d1%8f+%d0%bf%d1%80%d0%b5%d0%b7%d0%b8%d0%b4%d0%b5%d0%bd%d1%82%d0%be%d0%bc+%d0%a0%d0%be%d1%81%d1%81%d0%b8%d0

I need to get an encoded normal string. 我需要获取一个编码的普通字符串。

Answer 1

You were getting this error because requests was not able to identify the correct encoding of the response. 您收到此错误是因为请求无法识别响应的正确编码。 So if you are sure about the response encoding then you can set it like the following: 因此，如果您对响应编码有把握，则可以像下面这样设置：

response = requests.get(url) response.encoding --> to check the encoding response.encoding = "utf-8" --> or any other encoding.

And then get the content with .text method. 然后使用.text方法获取内容。

Answer 2

I fixed it with urllib.parse.unquote() method: 我用urllib.parse.unquote()方法修复了它：

import requests
from urllib.parse import unquote

request = unquote(requests.get('https://google.ru/search?q=Кто является президентом России?').text.lower())
print(request)

Python 3 requests.get（）。text返回未编码的字符串

问题描述

2 个解决方案

解决方案1
1 2018-08-21 12:23:35

解决方案2
0 2018-08-21 11:24:48

Python 3 requests.get（）。text返回未编码的字符串

问题描述

2 个解决方案

解决方案1 1 2018-08-21 12:23:35

解决方案2 0 2018-08-21 11:24:48

解决方案1
1 2018-08-21 12:23:35

解决方案2
0 2018-08-21 11:24:48