C++ åèæå
- C++11
- C++14
- C++17
- C++20
- C++ ç¼è¯å¨æ¯ææ åµè¡¨
- ç¬ç«ä¸å®¿ä¸»å®ç°
- C++ è¯è¨
- C++ å ³é®è¯
- é¢å¤çå¨
- C++ æ ååºå¤´æä»¶
- å ·åè¦æ±
- åè½ç¹æ§æµè¯ (C++20)
- å·¥å ·åº
- ç±»åæ¯æï¼åºæ¬ç±»åãRTTIãç±»åç¹æ§ï¼
- æ¦å¿µåº (C++20)
- é误å¤ç
- 卿å å管ç
- æ¥æåæ¶é´å·¥å ·
- å符串åº
- std::basic_string
- std::basic_string_view
- ç©ºç»æ¢åèå符串
- ç©ºç»æ¢å¤åèå符串
- std::mbsinit
- std::mbtowc
- std::mbstowcs
- std::btowc
- std::c8rtomb
- std::mbrtoc8
- std::mbrtowc
- std::mbsrtowcs
- std::mbrtoc16
- std::mbrtoc32
- std::mblen
- std::wctomb
- std::wcstombs
- std::wctob
- std::wcrtomb
- std::wcsrtombs
- std::c16rtomb
- std::c32rtomb
- std::mbrlen
- std::mbstate_t
- ç©ºç»æ¢å®½å符串
- std::char_traits
- 注é
- 容å¨åº
- è¿ä»£å¨åº
- èå´åº (C++20)
- ç®æ³åº
- æ°å¼åº
- è¾å ¥/è¾åºåº
- æä»¶ç³»ç»åº
- æ¬å°ååº
- æ£å表达å¼åº
- ååæä½åº
- çº¿ç¨æ¯æåº
- å®éªæ§ C++ ç¹æ§
- æç¨çèµæº
- ç´¢å¼
- std 符å·ç´¢å¼
- åç¨æ¯æ (C++20)
- C++ å ³é®è¯
ä½ç½®ï¼é¦é¡µ > C++ åèæå >åç¬¦ä¸²åº >ç©ºç»æ¢å¤åèå符串 > std::mbrtowc
std::mbrtowc
| å®ä¹äºå¤´æä»¶ <cwchar>
|
||
| std::size_t mbrtowc( wchar_t* pwc, Â Â Â Â Â Â Â Â Â Â Â const char* s, |
||
转æ¢çªå¤åèå符为宽å符ã
è¥ s 䏿¯ç©ºæéï¼åä» s ææåçåèèµ·ï¼å®¡æ¥å¤åèå符串çè³å¤ n 个åèï¼ä»¥ç¡®å®å®æä¸ä¸ªå¤åèå符ï¼å
å«ä»»ä½è¿ç§»åºåï¼çåèæ°ãè¥è¯¥å½æ°ç¡®å® s çä¸ä¸ªå¤åèåç¬¦å®æ´ä¸åæ³ï¼åå°å®è½¬æ¢ä¸ºå¯¹åºç宽å符ï¼å¹¶åå¨ä¹äº *pwc ï¼è¥ pwc é空ï¼ã
è¥ s æ¯ç©ºæéï¼åå¿½ç¥ n å pwc çå¼ï¼èè°ç¨çä»·äº std::mbrtowc(NULL, "", 1, ps) ã
è¥äº§çç宽å符æ¯ç©ºå符ï¼ååå¨äº *ps ç转æ¢ç¶æä¸ºåå§è¿ç§»ç¶æã
åæ°
| pwc | - | æåè¦åå ¥ç»æå®½å符ä½ç½®çæé |
| s | - | æåç¨ä½è¾å ¥çå¤åèå符串çæé |
| n | - | s ä¸è½è¢«æ£éªçåèæ°éå¶
|
| ps | - | æå转è¯å¤åèå符串æ¶ç转æ¢ç¶æçæé |
è¿åå¼
ä¸åé¦ä¸ªå¯åºç¨è ï¼
- è¥è½¬æ¢èª
sçå符为空å符ï¼å为 â0â ï¼è¥ pwc é空ååå¨äº*pwcï¼ - æåå°
sä»è½¬æ¢çå¤åèå符 [1...n] çåèæ° - è¥æ¥ä¸æ¥ç
nåèç»æä¸å®æ´ï¼ä½è¯¥é¨ååæ³ç宽å符ï¼å为 static_cast<std::size_t>(-2) ãä¸åå ¥ *pwc ã - è¥åºç°ç¼ç é误å为 static_cast<std::size_t>(-1) ãä¸åå
¥
*pwcï¼åå¨å¼ EILSEQ äº errno ï¼å¹¶ä¸ç½® *ps çå¼ä¸ºæªæå®ã
示ä¾
#include <iostream> #include <clocale> #include <cstring> #include <cwchar>  void print_mb(const char* ptr) { std::mbstate_t state = std::mbstate_t(); // åå§ç¶æ const char* end = ptr + std::strlen(ptr); int len; wchar_t wc; while((len = std::mbrtowc(&wc, ptr, end-ptr, &state)) > 0) { std::wcout << "Next " << len << " bytes are the character " << wc << '\n'; ptr += len; } }  int main() { std::setlocale(LC_ALL, "en_US.utf8"); // UTF-8 çªå¤åèç¼ç const char* str = u8"z\u00df\u6c34\U0001d10b"; // æ u8"zÃæ°´????" // æ "\x7a\xc3\x9f\xe6\xb0\xb4\xf0\x9d\x84\x8b"; print_mb(str); }
è¾åºï¼
Next 1 bytes are the character z Next 2 bytes are the character à Next 3 bytes are the character 水 Next 4 bytes are the character ????
åé
| å°ä¸ä¸ä¸ªå¤åèåç¬¦è½¬æ¢æå®½å符 (彿°) | |
| ç»å®ç¶æï¼è½¬æ¢å®½å符å°å
¶å¤åè表示 (彿°) | |
| [è] |
ä» externT 转æ¢å符串为 internT ï¼å¦å¨ä»æä»¶è¯»åæ¶ ( std::codecvt<InternT,ExternT,State> çèåä¿æ¤æå彿°) |