How to convert Turkish chars to English chars in a string?

Question

string strTurkish = "ÜST";

Answer 1

You can use the following method for solving your problem. The other methods do not convert "Turkish Lowercase I (\ı)" correctly.

public static string RemoveDiacritics(string text)
{
    Encoding srcEncoding = Encoding.UTF8;
    Encoding destEncoding = Encoding.GetEncoding(1252); // Latin alphabet

    text = destEncoding.GetString(Encoding.Convert(srcEncoding, destEncoding, srcEncoding.GetBytes(text)));

    string normalizedString = text.Normalize(NormalizationForm.FormD);
    StringBuilder result = new StringBuilder();

    for (int i = 0; i < normalizedString.Length; i++)
    {
        if (!CharUnicodeInfo.GetUnicodeCategory(normalizedString[i]).Equals(UnicodeCategory.NonSpacingMark))
        {
            result.Append(normalizedString[i]);
        }
    }

    return result.ToString();
}

Answer 2

var text = "ÜST";
var unaccentedText  = String.Join("", text.Normalize(NormalizationForm.FormD)
        .Where(c => char.GetUnicodeCategory(c) != UnicodeCategory.NonSpacingMark));

Answer 3

I'm not an expert on this sort of thing, but I think you can use string.Normalize to do it, by decomposing the value and then effectively removing an non-ASCII characters:

using System;
using System.Linq;
using System.Text;

class Test
{
    static void Main()
    {
        string text = "\u00DCST";
        string normalized = text.Normalize(NormalizationForm.FormD);
        string asciiOnly = new string(normalized.Where(c => c < 128).ToArray());
        Console.WriteLine(asciiOnly);
    }    
}

It's entirely possible that this does horrible things in some cases though.

Answer 4

This is not a problem that requires a general solution. It is known that there only 12 special characters in Turkish alphabet that has to be normalized. Those are ı,İ,ö,Ö,ç,Ç,ü,Ü,ğ,Ğ,ş,Ş. You can write 12 rules to replace those with their English counterparts: i,I,o,O,c,C,u,U,g,G,s,S.

Answer 5

public string TurkishCharacterToEnglish(string text)
{
    char[] turkishChars = {'ı', 'ğ', 'İ', 'Ğ', 'ç', 'Ç', 'ş', 'Ş', 'ö', 'Ö', 'ü', 'Ü'};
    char[] englishChars = {'i', 'g', 'I', 'G', 'c', 'C', 's', 'S', 'o', 'O', 'u', 'U'};
    
    // Match chars
    for (int i = 0; i < turkishChars.Length; i++)
        text = text.Replace(turkishChars[i], englishChars[i]);

    return text;
}

Answer 6

Public Function Ceng(ByVal _String As String) As String
    Dim Source As String = "ığüşöçĞÜŞİÖÇ"
    Dim Destination As String = "igusocGUSIOC"
    For i As Integer = 0 To Source.Length - 1
        _String = _String.Replace(Source(i), Destination(i))
    Next
    Return _String
End Function

Answer 7

    public static string TurkishChrToEnglishChr(this string text)
    {
        if (string.IsNullOrEmpty(text)) return text;

        Dictionary<char, char> TurkishChToEnglishChDic = new Dictionary<char, char>()
        {
            {'ç','c'},
            {'Ç','C'},
            {'ğ','g'},
            {'Ğ','G'},
            {'ı','i'},
            {'İ','I'},
            {'ş','s'},
            {'Ş','S'},
            {'ö','o'},
            {'Ö','O'},
            {'ü','u'},
            {'Ü','U'}
        };

        return text.Aggregate(new StringBuilder(), (sb, chr) =>
        {
            if (TurkishChToEnglishChDic.ContainsKey(chr))
                sb.Append(TurkishChToEnglishChDic[chr]);
            else
                sb.Append(chr);

            return sb;
        }).ToString();
    }

Answer 8

Hey go through this link, you'll find the code for it. I didn't create it though, just to make sure.

How to convert Turkish chars to English chars in a string?

Question

7 answers

solution1
20 2012-12-19 13:26:46

solution2
19 ACCPTED 2012-12-01 15:25:11

solution3
7 2012-12-01 15:22:54

solution4
2 2013-04-22 10:24:20

solution5
2 2020-12-14 10:20:36

solution6
1 2014-12-04 20:36:33

solution7
0 2022-01-26 13:43:57

solution8
0 2022-01-26 13:50:07

How to convert Turkish chars to English chars in a string?

Question

7 answers

solution1 20 2012-12-19 13:26:46

solution2 19 ACCPTED 2012-12-01 15:25:11

solution3 7 2012-12-01 15:22:54

solution4 2 2013-04-22 10:24:20

solution5 2 2020-12-14 10:20:36

solution6 1 2014-12-04 20:36:33

solution7 0 2022-01-26 13:43:57

solution8 0 2022-01-26 13:50:07

solution1
20 2012-12-19 13:26:46

solution2
19 ACCPTED 2012-12-01 15:25:11

solution3
7 2012-12-01 15:22:54

solution4
2 2013-04-22 10:24:20

solution5
2 2020-12-14 10:20:36

solution6
1 2014-12-04 20:36:33

solution7
0 2022-01-26 13:43:57

solution8
0 2022-01-26 13:50:07