如何从 HTTP 标头响应中解析 Content-Type 的值？

Question

My application makes numerous HTTP requests.我的应用程序发出大量 HTTP 请求。 Without writing a regular expression, how do I parse Content-Type header values?不编写正则表达式，如何解析Content-Type标头值？ For example:例如：

text/html; charset=UTF-8

For context, here is my code for getting stuff in the internet:对于上下文，这是我在互联网上获取东西的代码：

from requests import head

foo = head("http://www.example.com")

The output I am expecting is similar to what the methods do in mimetools .我期望的输出类似于mimetools 中的方法。 For example:例如：

x = magic("text/html; charset=UTF-8")

Will output:将输出：

x.getparam('charset')  # UTF-8
x.getmaintype()  # text
x.getsubtype()  # html

Answer 1

requests doesn't give you an interface to parse the content type, unfortunately, and the standard library on this stuff is a bit of a mess.不幸的是， requests没有给你一个解析内容类型的接口，而且这个东西的标准库有点乱。 So I see two options:所以我看到两个选项：

Option 1 : Go use the python-mimeparse third-party library.选项 1 ：去使用python-mimeparse第三方库。

Option 2 : To separate the mime type from options like charset , you can use the same technique that requests uses to parse type/encoding internally: use cgi.parse_header .选项 2 ：要将 mime 类型与charset选项分开，您可以使用requests用于在内部解析类型/编码的相同技术：使用cgi.parse_header 。

response = requests.head('http://example.com')
mimetype, options = cgi.parse_header(response.headers['Content-Type'])

The rest should be simple enough to handle with a split :其余的应该足够简单，可以用split处理：

maintype, subtype = mimetype.split('/')

Answer 2

Your question is bit unclear.你的问题有点不清楚。 I assume that you are using some sort of web application framework such as Django or Flask?我假设您正在使用某种 Web 应用程序框架，例如 Django 或 Flask？

Here is example how to read Content-Type using Flask:以下是如何使用 Flask 读取 Content-Type 的示例：

from flask import Flask, request
app = Flask(__name__)

@app.route("/")
def test():
  request.headers.get('Content-Type')


if __name__ == "__main__":
  app.run()

Answer 3

Your response ( foo ) will have a dictionary with the headers.您的响应 ( foo ) 将有一个带有标题的字典。 Try something like:尝试类似：

foo.headers.get('content-type')

Or print foo.headers to see all the headers.或者打印foo.headers以查看所有标题。

如何从 HTTP 标头响应中解析 Content-Type 的值？

问题描述

3 个解决方案

解决方案1
17 2016-04-14 15:34:16

解决方案2
0 2015-09-01 11:27:21

解决方案3
-1 2015-09-01 11:27:06

如何从 HTTP 标头响应中解析 Content-Type 的值？

问题描述

3 个解决方案

解决方案1 17 2016-04-14 15:34:16

解决方案2 0 2015-09-01 11:27:21

解决方案3 -1 2015-09-01 11:27:06

解决方案1
17 2016-04-14 15:34:16

解决方案2
0 2015-09-01 11:27:21

解决方案3
-1 2015-09-01 11:27:06