简体繁体中英

How to detect an appropriate String locale in Java

原文 2015-11-27 17:08:34 8 2 java/ unicode/ locale

In current project I need to lowercase the incoming text, which can be passed in English / German / Turkish languages. Ordinary String#toLowerCase() fails for some characters of the Turkish alphabet because, for example, it is necessary to map non-ASCII character http://unicode-table.com/en/0130/ to ASCII http://unicode-table.com/en/0069/ . Java 7 handles this mapping without any issues in case I provide the locale, ie. str.toLowerCase(new Locale(“tr”)) is necessary. But this case it looks I should to detect the appropriate locale of given text, because it could be written on one of three possible languages.

Is there any way to perform the appropriate locale detection or is this way wrong?

EDIT 1

I didn't mention the actual use case, I'm adding tags to the entity via the REST API and I guess I'm not allowed to change the API contract..

2 answers

There are libraries which use heuristics to detect a language with a certain probability. An example can be found here .

Probably there is a library that does this but I don't know such library. I can however offer you a simple solution.

There are several special characters in Turkish and German language. All other characters are plain English and therefore the problem is irrelevant for them. So, you can hold a list of special German and Turkish characters and detect the locale of current string by searching of these characters into the string. If one of Turkish characters is found in string consider it to be processed in Turkish locale, the same is for German. If no-one of special characters is found, use default locale.

This solution has some performance penalties because you are going to scan the string twice but this is not important for most applications.

How to convert String into locale in java

Detect windows system locale in java

how to detect operating system language (locale) from java code

Java Locale to String

How to detect that there is string in an input in Java?

Converting String to appropriate Java object

How to get Locale from its String representation in Java?

How do I convert a String to Double in Java using a specific locale?

How to use unsupported Locale in Java 11 and numbers in String.format()

String / Locale ascii letters in Java

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to convert String into locale in java Detect windows system locale in java how to detect operating system language (locale) from java code Java Locale to String How to detect that there is string in an input in Java? Converting String to appropriate Java object How to get Locale from its String representation in Java? How do I convert a String to Double in Java using a specific locale? How to use unsupported Locale in Java 11 and numbers in String.format() String / Locale ascii letters in Java

Related Tags

How to detect an appropriate String locale in Java

Question

2 answers

solution1
1 2015-11-27 17:10:54

solution2
1 2015-11-27 17:14:54

How to detect an appropriate String locale in Java

Question

2 answers

solution1 1 2015-11-27 17:10:54

solution2 1 2015-11-27 17:14:54

solution1
1 2015-11-27 17:10:54

solution2
1 2015-11-27 17:14:54