简体   繁体   中英

Convert unicode representations on incoming string to UTF-8?

I'm reading some data that has already been converted to html style υ code.

I now need to convert this back to UTF-8 characters for viewing. Unfortunately I can't use a browser to view the string.

I've read around about conversion in java and it seems if you have a string of \uxxxx then the compiler will convert for you; However that wont work of course because I want to read in dynamic strings.

So can this be done?

Many thanks! Dan

You need to use:

String StringEscapeUtils.unescapeJava(String str)

from the Apache Commons Library.

It will find \uxxxx sequences in the input string and convert them to a normal Java String.

native2ascii

Use the "-reverse" option.

converted to html style υ code.

Which of those? \uXXXX is a Java convention and has nothing to do with HTML. HTML entities look like ¯

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM