如何将 UnicodeString 复制到 C++Builder Android 应用程序中的 wchar_t 数组？

Question

I am using C++Builder 10.3 Rio developing a multi-platform app for Android.我正在使用 C++Builder 10.3 Rio 为 Android 开发多平台应用程序。

I have an array of data as follows:我有一个数据数组如下：

typedef struct recordstruct
{
    bool shop;
    bool bought;
    wchar_t description[80];
} recordtype;

recordtype MasterItems[MAXITEMS]=
{
    false,false,L"Apples",
    false,false,L"Apricots",
    false,false,L"Avocado",
...
...
};

I've copied this into a TEdit , and want to get the value back to the MasterItems array.我已将其复制到TEdit中，并希望将值返回到MasterItems数组。

I used to use c_str() and mbstowcs() and strcpy() / wcscpy() etc.我曾经使用c_str()和mbstowcs()和strcpy() / wcscpy()等。

How can I do this please?请问我该怎么做？

Answer 1

UnicodeString is a UTF-16 encoded string on all platforms. UnicodeString是所有平台上的 UTF-16 编码字符串。 However, wchar_t is a 16bit type used for UTF-16 data only on Windows.但是， wchar_t是 16 位类型，仅用于 Windows 上的 UTF-16 数据。 On other platforms, wchar_t is a 32bit type used for UTF-32 data.在其他平台上， wchar_t是用于 UTF-32 数据的 32 位类型。

This is documented in Embarcadero's DocWiki:这记录在 Embarcadero 的 DocWiki 中：

String Literals char16_t and wchar_t on macOS and iOS macOS 和 iOS 上的字符串文字 char16_t 和 wchar_t
(Android is included, too) （也包括Android）

On macOS and iOS, char16_t is not equivalent to wchar_t (as it is on Windows):在 macOS 和 iOS 上， char16_t不等同于wchar_t （就像在 Windows 上一样）：

On Windows, wchar_t and char16_t are both double-byte characters.在 Windows 上， wchar_t和char16_t都是双字节字符。

On macOS , iOS , and Android , however, a wchar_t is a 4-byte character.但是，在macOS 、 iOS和 Android上， wchar_t是一个 4 字节字符。

So, to declare UTF-16 constant strings, on Windows use either the L or the u prefix, whereas on macOS, iOS, and Android , use the u prefix.因此，要声明 UTF-16 常量字符串，在 Windows 上使用L或u前缀，而在 macOS、iOS和 Android上使用u前缀。

Example on Windows: Windows 示例：

UnicodeString(L"Text"), UnicodeString(u"Text")

Example on macOS, iOS, and Android : macOS、iOS和 Android上的示例：

UnicodeString(u"Text")

Using the L prefix for string literals on macOS, iOS, and Android is not, however, wrong.但是，在 macOS、iOS和 Android上对字符串文字使用L前缀并没有错。 In this case, UTF-32 constant strings are converted to UTF-16 strings.在这种情况下，UTF-32 常量字符串被转换为 UTF-16 字符串。

For portability, use the _D macro to write constant strings that are prefixed accordingly with L or u .为了可移植性，使用_D宏来编写相应地以L或u为前缀的常量字符串。 Example:例子：

UnicodeString(_D("Text"))

To ensure UTF-16 is used on all platforms, the System::WideChar type is an alias for wchar_t on Windows and char16_t on other platforms.为确保在所有平台上使用 UTF-16， System::WideChar类型是 Windows 上的wchar_t和其他平台上的char16_t的别名。 UnicodeString is a container of WideChar elements. UnicodeString是WideChar元素的容器。

So, if you use wchar_t for your array, then on non-Windows platforms you will need to first convert your UnicodeString to UTF-32 at runtime, such as with the RTL's UnicodeStringToUCS4String() function, before you can then copy that data into your array, eg:因此，如果您对数组使用wchar_t ，那么在非 Windows 平台上，您需要先在运行时将UnicodeString转换为 UTF-32，例如使用 RTL 的UnicodeStringToUCS4String() function，然后才能将该数据复制到您的数组，例如：

typedef struct recordstruct
{
    bool shop;
    bool bought;
    wchar_t description[80];
} recordtype;

recordtype MasterItems[MAXITEMS]=
{
    false,false,L"Apples",
    false,false,L"Apricots",
    false,false,L"Avocado",
    ...
};

...

#if defined(WIDECHAR_IS_WCHAR) // WideChar = wchar_t = 2 bytes

StrLCopy(MasterItems[index].description, Edit1->Text.c_str(), std::size(MasterItems[index].description)-1); // -1 for null terminator

/* or:
UnicodeString s = Edit1->Text;
size_t len = std::min(s.Length(), std::size(MasterItems[index].destination)-1); // -1 for null terminator
std::copy_n(s.c_str(), len, MasterItems[index].destination);
MasterItems[index].destination[len] = L'\0';
*/

#elif defined(WIDECHAR_IS_CHAR16) // WideChar = char16_t, wchar_t = 4 bytes

UCS4String s = UnicodeStringToUCS4String(Edit1->Text);
size_t len = std::min(s.Length-1, std::size(MasterItems[index].destination)-1); // UCS4String::Length includes the null terminator!
std::copy_n(&s[0], len, MasterItems[index].destination);
MasterItems[index].destination[len] = L'\0';

#else

// unsupported wchar_t size!

#endif

Otherwise, if you want to ensure your array is always 16bit UTF-16 on all platforms, then you need to use char16_t or WideChar instead of wchar_t in your array.否则，如果您想确保您的数组在所有平台上始终是 16 位 UTF-16，那么您需要在数组中使用char16_t或WideChar而不是wchar_t 。 The u prefix creates a char16_t -based literal, and the RTL's _D() macro creates a WideChar -based literal (using L or u according to platform), eg: u前缀创建基于char16_t的文字，而 RTL 的_D()宏创建基于WideChar的文字（根据平台使用L或u ），例如：

typedef struct recordstruct
{
    bool shop;
    bool bought;
    char16_t description[80]; // or: System::WideChar
} recordtype;

recordtype MasterItems[MAXITEMS]=
{
    false,false,u"Apples", // or: _D("Apples")
    false,false,u"Apricots", // or: _D("Apricots")
    false,false,u"Avocado", // or: _D("Avocado")
    ...
};

...

StrLCopy(MasterItems[index].description, Edit1->Text.c_str(), std::size(MasterItems[index].description)-1); // -1 for null terminator

/* or:
UnicodeString s = Edit1->Text;
size_t len = std::min(s.Length(), std::size(MasterItems[index].description)-1); // -1 for null terminator
std::copy_n(s.c_str(), len, MasterItems[index].description);
MasterItems[index].description[len] = u'\0'; // or: _D('\0')
*/

如何将 UnicodeString 复制到 C++Builder Android 应用程序中的 wchar_t 数组？

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-06-03 19:15:22

如何将 UnicodeString 复制到 C++Builder Android 应用程序中的 wchar_t 数组？

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-06-03 19:15:22

解决方案1
1 已采纳 2020-06-03 19:15:22