UTF-8字符轉換

Question

我目前有一個std::string ，它包含這個

"\xa9 2006 FooWorld"

基本上，它包含符號©。 該字符串將傳遞給采用UTF-8的外部API的方法。 如何使此字符串UTF-8兼容？ 有什么建議么。 我在這里讀到我可以使用std::wstring_convert但是我不確定如何在我的情況下應用它。 任何建議，將不勝感激。

Answer 1

這很簡單：使用UTF-8字符串文字：

u8"\u00A9 2006 FooWorld"

這將導致const char[] ，它是正確編碼的UTF-8字符串。

Answer 2

在C ++ 11和更高版本中，獲取UTF-8編碼的字符串文字的最佳方法是使用u8前綴：

std:string str = u8"\u00A9 2006 FooWorld";

要么：

std:string str = u8"© 2006 FooWorld";

但是，您也可以使用std::wstring_convert （尤其是如果您的輸入數據不是字符串文字）：

#include <codecvt>
#include <locale>
#include <string>

std::wstring wstr = L"© 2006 FooWorld"; // or whatever...

std::wstring_convert<std::codecvt_utf8<wchar_t>, wchar_t> convert;

std::string str = convert.to_bytes(wstr);

UTF-8字符轉換

問題描述

2 個解決方案

解決方案1
1 已采納 2018-04-05 01:24:41

解決方案2
0 2018-04-06 01:29:40

UTF-8字符轉換

問題描述

2 個解決方案

解決方案1 1 已采納 2018-04-05 01:24:41

解決方案2 0 2018-04-06 01:29:40

解決方案1
1 已采納 2018-04-05 01:24:41

解決方案2
0 2018-04-06 01:29:40