获取响应获取或发布请求的响应字符集

Question

I am working to extract response charset in a java web app, where I am using Apache HTTP Client. 我正在java web应用程序中提取响应字符集，我正在使用Apache HTTP Client。

For example, one possible value obtained from "Content-Type" header is 例如，从“Content-Type”标题获得的一个可能值是

    text/html; charset=UTF-8

Then my code will extract all text after the "=" sign... 然后我的代码将在“=”符号后提取所有文本...

So the charset as extracted will be 所以提取的字符集将是

    UTF-8

I just wanted to know, is the above method for obtaining response charset correct? 我只是想知道，上面获取响应字符集的方法是否正确？ Or is there some scenario where the above code will not work? 或者是否存在上述代码不起作用的情况？ Is there something I am missing here? 这里有什么我想念的吗？

Answer 1

Doesn't httpclient (or http core) already provide that functionality? httpclient（或http核心）是否已经提供了该功能？ Something like this: 像这样的东西：

HttpResponse response = ...
String charset = EntityUtils.getContentCharSet(response.getEntity());

Answer 2

The method provided by forty-two can work. 由四十二提供的方法可以工作。 But the method is deprecated, I find out that this website has a good example of method to find the charset. 但该方法已弃用，我发现该网站有一个很好的查找字符集的方法示例。

HttpEntity entity = response.getEntity();
ContentType contentType = ContentType.getOrDefault(entity);
Charset charset = contentType.getCharset();
System.out.println("Charset  = " + charset.toString());

Answer 3

Well, that approach will fail when 那么，这种方法会失败

the charset value is quoted 引用charset值
when the quoted value uses escapes 当引用的值使用转义时
when there are parameters other than "charset" 什么时候有“charset”以外的参数

获取响应获取或发布请求的响应字符集

问题描述

3 个解决方案

解决方案1
8 已采纳 2012-02-02 12:55:27

解决方案2
7 2015-12-15 05:47:35

解决方案3
3 2012-02-02 12:43:48

获取响应获取或发布请求的响应字符集

问题描述

3 个解决方案

解决方案1 8 已采纳 2012-02-02 12:55:27

解决方案2 7 2015-12-15 05:47:35

解决方案3 3 2012-02-02 12:43:48

解决方案1
8 已采纳 2012-02-02 12:55:27

解决方案2
7 2015-12-15 05:47:35

解决方案3
3 2012-02-02 12:43:48