C# 将字符串从 UTF-8 转换为 ISO-8859-1 (Latin1) H

Question

I have googled on this topic and I have looked at every answer, but I still don't get it.我在这个话题上用谷歌搜索过，我看过每个答案，但我仍然不明白。

Basically I need to convert UTF-8 string to ISO-8859-1 and I do it using following code:基本上，我需要将 UTF-8 字符串转换为 ISO-8859-1，并使用以下代码执行此操作：

Encoding iso = Encoding.GetEncoding("ISO-8859-1");
Encoding utf8 = Encoding.UTF8;
string msg = iso.GetString(utf8.GetBytes(Message));

My source string is我的源字符串是

Message = "ÄäÖöÕõÜü"

But unfortunately my result string becomes但不幸的是我的结果字符串变成

msg = "Ã?Ã¤Ã?Ã¶Ã?ÃµÃ?Ã¼

What I'm doing wrong here?我在这里做错了什么？

Answer 1

Use Encoding.Convert to adjust the byte array before attempting to decode it into your destination encoding.在尝试将其解码为目标编码之前，使用Encoding.Convert调整字节数组。

Encoding iso = Encoding.GetEncoding("ISO-8859-1");
Encoding utf8 = Encoding.UTF8;
byte[] utfBytes = utf8.GetBytes(Message);
byte[] isoBytes = Encoding.Convert(utf8, iso, utfBytes);
string msg = iso.GetString(isoBytes);

Answer 2

I think your problem is that you assume that the bytes that represent the utf8 string will result in the same string when interpreted as something else (iso-8859-1).我认为您的问题是您假设表示 utf8 字符串的字节在解释为其他内容时会产生相同的字符串（iso-8859-1）。 And that is simply just not the case.而事实并非如此。 I recommend that you read this excellent article by Joel spolsky.我建议您阅读 Joel spolsky 撰写的这篇优秀文章。

Answer 3

Try this:试试这个：

Encoding iso = Encoding.GetEncoding("ISO-8859-1");
Encoding utf8 = Encoding.UTF8;
byte[] utfBytes = utf8.GetBytes(Message);
byte[] isoBytes = Encoding.Convert(utf8,iso,utfBytes);
string msg = iso.GetString(isoBytes);

Answer 4

You need to fix the source of the string in the first place.您首先需要修复字符串的来源。

A string in .NET is actually just an array of 16-bit unicode code-points, characters, so a string isn't in any particular encoding. .NET 中的字符串实际上只是一个由 16 位 unicode 代码点、字符组成的数组，因此字符串没有任何特定的编码。

It's when you take that string and convert it to a set of bytes that encoding comes into play.当您获取该字符串并将其转换为一组字节时，编码就起作用了。

In any case, the way you did it, encoded a string to a byte array with one character set, and then decoding it with another, will not work, as you see.在任何情况下，如您所见，您使用一种字符集将字符串编码为字节数组，然后使用另一种字符集对其进行解码的方式将不起作用。

Can you tell us more about where that original string comes from, and why you think it has been encoded wrong?你能告诉我们更多关于原始字符串的来源，以及你认为它编码错误的原因吗？

Answer 5

Seems bit strange code.看起来有点奇怪的代码。 To get string from Utf8 byte stream all you need to do is:要从 Utf8 字节流中获取字符串，您需要做的就是：

string str = Encoding.UTF8.GetString(utf8ByteArray);

If you need to save iso-8859-1 byte stream to somewhere then just use: additional line of code for previous:如果您需要将 iso-8859-1 字节流保存到某处，那么只需使用：前一行的附加代码行：

byte[] iso88591data = Encoding.GetEncoding("ISO-8859-1").GetBytes(str);

Answer 6

Just used the Nathan's solution and it works fine.刚刚使用了 Nathan 的解决方案，效果很好。 I needed to convert ISO-8859-1 to Unicode:我需要将 ISO-8859-1 转换为 Unicode：

string isocontent = Encoding.GetEncoding("ISO-8859-1").GetString(fileContent, 0, fileContent.Length);
byte[] isobytes = Encoding.GetEncoding("ISO-8859-1").GetBytes(isocontent);
byte[] ubytes = Encoding.Convert(Encoding.GetEncoding("ISO-8859-1"), Encoding.Unicode, isobytes);
return Encoding.Unicode.GetString(ubytes, 0, ubytes.Length);

Answer 7

Encoding targetEncoding = Encoding.GetEncoding(1252);
// Encode a string into an array of bytes.
Byte[] encodedBytes = targetEncoding.GetBytes(utfString);
// Show the encoded byte values.
Console.WriteLine("Encoded bytes: " + BitConverter.ToString(encodedBytes));
// Decode the byte array back to a string.
String decodedString = Encoding.Default.GetString(encodedBytes);

Answer 8

Maybe it can help也许它可以帮助
Convert one codepage to another:将一个代码页转换为另一个：

    public static string fnStringConverterCodepage(string sText, string sCodepageIn = "ISO-8859-8", string sCodepageOut="ISO-8859-8")
    {
        string sResultado = string.Empty;
        try
        {
            byte[] tempBytes;
            tempBytes = System.Text.Encoding.GetEncoding(sCodepageIn).GetBytes(sText);
            sResultado = System.Text.Encoding.GetEncoding(sCodepageOut).GetString(tempBytes);
        }
        catch (Exception)
        {
            sResultado = "";
        }
        return sResultado;
    }

Usage:用法：

string sMsg = "ERRO: NÃ£o foi possivel acessar o servico de AutenticaÃ§Ã£o";
var sOut = fnStringConverterCodepage(sMsg ,"ISO-8859-1","UTF-8"));

Output:输出：

"Não foi possivel acessar o servico de Autenticação"

Answer 9

Here is a sample for ISO-8859-9;这是 ISO-8859-9 的示例；

protected void btnKaydet_Click(object sender, EventArgs e)
{
    Response.Clear();
    Response.Buffer = true;
    Response.ContentType = "application/vnd.openxmlformatsofficedocument.wordprocessingml.documet";
    Response.AddHeader("Content-Disposition", "attachment; filename=XXXX.doc");
    Response.ContentEncoding = Encoding.GetEncoding("ISO-8859-9");
    Response.Charset = "ISO-8859-9";
    EnableViewState = false;


    StringWriter writer = new StringWriter();
    HtmlTextWriter html = new HtmlTextWriter(writer);
    form1.RenderControl(html);


    byte[] bytesInStream = Encoding.GetEncoding("iso-8859-9").GetBytes(writer.ToString());
    MemoryStream memoryStream = new MemoryStream(bytesInStream);


    string msgBody = "";
    string Email = "mail@xxxxxx.org";
    SmtpClient client = new SmtpClient("mail.xxxxx.org");
    MailMessage message = new MailMessage(Email, "mail@someone.com", "ONLINE APP FORM WITH WORD DOC", msgBody);
    Attachment att = new Attachment(memoryStream, "XXXX.doc", "application/vnd.openxmlformatsofficedocument.wordprocessingml.documet");
    message.Attachments.Add(att);
    message.BodyEncoding = System.Text.Encoding.UTF8;
    message.IsBodyHtml = true;
    client.Send(message);}

C# 将字符串从 UTF-8 转换为 ISO-8859-1 (Latin1) H

问题描述

9 个解决方案

解决方案1
186 已采纳 2009-12-17 14:47:39

解决方案2
27 2009-12-17 14:45:50

解决方案3
16 2009-12-17 14:47:23

解决方案4
8 2009-12-17 14:44:54

解决方案5
7 2014-06-13 08:54:56

解决方案6
0 2014-06-27 13:55:35

解决方案7
0 2014-10-26 13:55:15

解决方案8
0 2020-12-18 22:49:23

解决方案9
-5 2015-09-17 08:02:40

C# 将字符串从 UTF-8 转换为 ISO-8859-1 (Latin1) H

问题描述

9 个解决方案

解决方案1 186 已采纳 2009-12-17 14:47:39

解决方案2 27 2009-12-17 14:45:50

解决方案3 16 2009-12-17 14:47:23

解决方案4 8 2009-12-17 14:44:54

解决方案5 7 2014-06-13 08:54:56

解决方案6 0 2014-06-27 13:55:35

解决方案7 0 2014-10-26 13:55:15

解决方案8 0 2020-12-18 22:49:23

解决方案9 -5 2015-09-17 08:02:40

解决方案1
186 已采纳 2009-12-17 14:47:39

解决方案2
27 2009-12-17 14:45:50

解决方案3
16 2009-12-17 14:47:23

解决方案4
8 2009-12-17 14:44:54

解决方案5
7 2014-06-13 08:54:56

解决方案6
0 2014-06-27 13:55:35

解决方案7
0 2014-10-26 13:55:15

解决方案8
0 2020-12-18 22:49:23

解决方案9
-5 2015-09-17 08:02:40