简体繁体中英

Linux vs. Windows: How does the console render unicode characters?

原文 2009-08-07 22:13:06 5 3 windows/ linux/ unicode/ encoding

This is quite a low-level (low in the sense of "closer to the metal") question.

I was wondering if any of you could point me to documentation, explanations, etc. of how, upon receiving a Unicode character (or any character code, but I'm particularly interested in the Unicode Standard) the console in Windows, good ol' cmd.exe (using, say, codepage 65001) and xterm in Linux started with, say, LC_CTYPE=en_US.UTF-8 look up the corresponding glyph (and where).

I know it may be harder to know in Windows, but I can't really find much information.

Thank you.

3 answers

As far as I can tell, cmd.exe is bound to whatever 256-character code page you defined as the "codepage for non-Unicode programs" or whatever it was called.

To elaborate, if I set the above setting to Japanese, cmd.exe suddenly replaces backslashes with yen signs (as does every other non-Unicode app on the system) and correctly interprets ShiftJIS codes, for example. Setting it to Dutch gives me an accented I (I forgot which), while another codepage would give a half-filled vertical solid instead on the same character.

Not Unicode. Unicode would let me do all three at the same time.

The console uses a TextWriter with an encoding created from the codepage. That means that the characters written are encoded into bytes using the specific Encoding object for the codepage.

the console doesn't support Unicode. :)

On Jenkins how can you detect if a server is Windows vs. Linux?

Why does additional characters show up when I print unicode characters to Windows 7 console from a Java program?

More unicode characters in windows console than expected

System libraries in Linux vs. Windows

Windows vs. Linux Text File Reading

Compilation Difference: Windows vs. Linux

Dynamic linking - Linux Vs. Windows

Python Unicode - What Characters Can Be Printed in Windows Console?

How to Output Unicode Strings on the Windows Console

How to change console program for unicode support in windows?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question On Jenkins how can you detect if a server is Windows vs. Linux? Why does additional characters show up when I print unicode characters to Windows 7 console from a Java program? More unicode characters in windows console than expected System libraries in Linux vs. Windows Windows vs. Linux Text File Reading Compilation Difference: Windows vs. Linux Dynamic linking - Linux Vs. Windows Python Unicode - What Characters Can Be Printed in Windows Console? How to Output Unicode Strings on the Windows Console How to change console program for unicode support in windows?

Related Tags

Linux vs. Windows: How does the console render unicode characters?

Question

3 answers

solution1
3 ACCPTED 2009-08-07 22:34:22

solution2
1 2009-08-07 22:38:45

solution3
1 2009-08-08 12:49:47

Linux vs. Windows: How does the console render unicode characters?

Question

3 answers

solution1 3 ACCPTED 2009-08-07 22:34:22

solution2 1 2009-08-07 22:38:45

solution3 1 2009-08-08 12:49:47

solution1
3 ACCPTED 2009-08-07 22:34:22

solution2
1 2009-08-07 22:38:45

solution3
1 2009-08-08 12:49:47