简体繁体中英

Manipulating strings of multibyte characters

原文 2014-01-16 07:38:22 8 2 c++/ c/ string

I am a novice C programmer. I am trying to write a C program which sometimes deals with English text (fits into 8-bit chars) and sometimes Japanese text (needs 16 bits).

Do I need to set aside 16 bits for every character, even the English text if I use the same code to manipulate either country's text?

What are some of the ways of encoding multibyte characters?

What if the compiler can't store multibyte strings compactly?

I'm confused. Please help me out here. Kindly, support your answers with code examples. Also, please explain the same with context of C++ as I am learning C++ also & have beginner-level experience in this language too.

Thanks in advance.

This was a interview question asked to one of my acquaintance a few days back.

2 answers

In C++ you can use std::wstring which uses wchar_t as the underlying char type. In C++11 you can also use std::u16string or std::u32string depending on the amount of storage for a character you need.

C also have wchar_t defined in <wchar.h> .

Okay, after doing a little bit of research, I think I got an answer:

mbstowcs ("multibyte string to wide character string") and wcstombs ("wide character string to multibyte string") convert between arrays of wchar_t (in which every character takes 16 bits, or two bytes) and multibyte strings (in which individual characters are stored in one byte if possible).

How to create multibyte characters in C

Manipulating strings c++

Test if char* string contains multibyte characters

Using wide strings with ifstream::open or multibyte strings with CreateProcess

Manipulating C-style strings

Manipulating C-style strings?

Transparently manipulating strings inserted into an ostream

C/C++ isspace() skipping multibyte string characters

Converting a string of multibyte characters to widechar's gives unexpected results

Can i use memcmp two compare multibyte characters string?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to create multibyte characters in C Manipulating strings c++ Test if char* string contains multibyte characters Using wide strings with ifstream::open or multibyte strings with CreateProcess Manipulating C-style strings Manipulating C-style strings? Transparently manipulating strings inserted into an ostream C/C++ isspace() skipping multibyte string characters Converting a string of multibyte characters to widechar's gives unexpected results Can i use memcmp two compare multibyte characters string?

Related Tags

Manipulating strings of multibyte characters

Question

2 answers

solution1
1 2014-01-16 08:54:42

solution2
0 2014-01-17 04:28:21

Manipulating strings of multibyte characters

Question

2 answers

solution1 1 2014-01-16 08:54:42

solution2 0 2014-01-17 04:28:21

solution1
1 2014-01-16 08:54:42

solution2
0 2014-01-17 04:28:21