简体繁体中英

How to get Java to use the correct character set?

原文 2012-08-23 14:02:19 9 2 java/ character-encoding/ centos/ cp1252

We've got our servers running on CentOS and our Java backend sometimes has to process a file that was originally generated on a Windows machine (by one of our clients) using CP-1252, however in 95%+ use cases, we are processing UTF-8 files.

My question: if we know that certain files will always be UTF-8, and other files will always be CP-1252, is it possible to specify in Java the character set to use for reading in each file? If so:

Do we need to do anything at the systems-level for adding CP-1252 to CentOS? If so, what does this involve?
What Java objects would we use to apply the correct encoding on a per file basis?

Thanks in advance!

2 answers

All you need to do is specify what charset/encoding the original file was written in while using the XXXReader(InputStream in, Charset cs) . For eg look at InputStreamReader

My question: if we know that certain files will always be UTF-8, and other files will always be CP-1252, is it possible to specify in Java the character set to use for reading in each file?

Assuming you're in charge of the code reading the file, it should be fine. Create a FileInputStream , then wrap it in an InputStreamReader specifying the relevant character encoding.

Do we need to do anything at the systems-level for adding CP-1252 to CentOS? If so, what does this involve?

That depends on what the JRE supports. I've never used CentOS, so I don't know whether it's likely to come with the relevant encoding as part of the JRE. You can use Charset.isSupported to check though, and Charset.availableCharsets to list what's available.

How to: Use java Set and Get

How to use get/set cookies in GAE/java

How to change 'character set' in Java?

How to set a character limit in Java

How to convert character set in Java?

How not to use escape character in java

How to get the Java syntax correct?

get character from r to java use rcaller

How to detect which character set encoding in Java?

In Java what is a dangling meta character and how do I correct this error?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to: Use java Set and Get How to use get/set cookies in GAE/java How to change 'character set' in Java? How to set a character limit in Java How to convert character set in Java? How not to use escape character in java How to get the Java syntax correct? get character from r to java use rcaller How to detect which character set encoding in Java? In Java what is a dangling meta character and how do I correct this error?

Related Tags

How to get Java to use the correct character set?

Question

2 answers

solution1
2 2012-08-23 14:05:05

solution2
1 ACCPTED 2012-08-23 14:05:46

How to get Java to use the correct character set?

Question

2 answers

solution1 2 2012-08-23 14:05:05

solution2 1 ACCPTED 2012-08-23 14:05:46

solution1
2 2012-08-23 14:05:05

solution2
1 ACCPTED 2012-08-23 14:05:46