C++ åèæå
- C++11
- C++14
- C++17
- C++20
- C++ ç¼è¯å¨æ¯ææ åµè¡¨
- ç¬ç«ä¸å®¿ä¸»å®ç°
- C++ è¯è¨
- C++ å ³é®è¯
- é¢å¤çå¨
- C++ æ ååºå¤´æä»¶
- å ·åè¦æ±
- åè½ç¹æ§æµè¯ (C++20)
- å·¥å ·åº
- ç±»åæ¯æï¼åºæ¬ç±»åãRTTIãç±»åç¹æ§ï¼
- æ¦å¿µåº (C++20)
- é误å¤ç
- 卿å å管ç
- æ¥æåæ¶é´å·¥å ·
- å符串åº
- std::basic_string
- std::basic_string_view
- ç©ºç»æ¢åèå符串
- ç©ºç»æ¢å¤åèå符串
- std::mbsinit
- std::mbtowc
- std::mbstowcs
- std::btowc
- std::c8rtomb
- std::mbrtoc8
- std::mbrtowc
- std::mbsrtowcs
- std::mbrtoc16
- std::mbrtoc32
- std::mblen
- std::wctomb
- std::wcstombs
- std::wctob
- std::wcrtomb
- std::wcsrtombs
- std::c16rtomb
- std::c32rtomb
- std::mbrlen
- std::mbstate_t
- ç©ºç»æ¢å®½å符串
- std::char_traits
- 注é
- 容å¨åº
- è¿ä»£å¨åº
- èå´åº (C++20)
- ç®æ³åº
- æ°å¼åº
- è¾å ¥/è¾åºåº
- æä»¶ç³»ç»åº
- æ¬å°ååº
- æ£å表达å¼åº
- ååæä½åº
- çº¿ç¨æ¯æåº
- å®éªæ§ C++ ç¹æ§
- æç¨çèµæº
- ç´¢å¼
- std 符å·ç´¢å¼
- åç¨æ¯æ (C++20)
- C++ å ³é®è¯
ä½ç½®ï¼é¦é¡µ > C++ åèæå >åç¬¦ä¸²åº >ç©ºç»æ¢å¤åèå符串 > std::mblen
std::mblen
| å®ä¹äºå¤´æä»¶ <cstdlib>
|
||
| int mblen( const char* s, std::size_t n ); |
||
ç¡®å®å¤åèå符çåè大å°ï¼å
¶é¦åè为 s ææåã
è¥ s æ¯ç©ºæéï¼åéç½®å
¨å±è½¬æ¢ç¶æå¹¶ç¡®å®æ¯å¦ä½¿ç¨è¿ç§»åºåã
æ¤å½æ°çä»·äºè°ç¨ std::mbtowc((wchar_t*)0, s, n) ï¼é¤äºä¸å½±å std::mbtowc ç转æ¢ç¶æã
注æ
æ¯æ¬¡å¯¹ mblen çè°ç¨æ´æ°å
é¨å
¨å±è½¬æ¢ç¶æï¼ std::mbstate_t ç±»åçéæå¯¹è±¡ï¼åªä¸ºæ¤å½æ°æç¥ï¼ãè¥å¤åèç¼ç 使ç¨è¿ç§»ç¶æï¼åå¿
é¡»çæä»¥é¿å
忤æå¤æ¬¡æ«æã任使
åµä¸ï¼å¤çº¿ç¨ä¸åºæ 忥å°è°ç¨ mblen ï¼å¯ç¨ std::mbrlen 代æ¿ã
åæ°
| s | - | æåå¤åèå符çæé |
| n | - | sä¸è½è¢«æ£éªçåèæ°éå¶
|
è¿åå¼
è¥ s 䏿¯ç©ºæéï¼åè¿åå¤åèå符æå«çåèæ°ï¼æè¥ s ææçé¦åèä¸ç»æåæ³å¤åèå符åè¿å -1 ï¼æè¥ s æå空å符 '\0' åè¿å â0â ã
è¥ s æ¯ç©ºæéï¼åéç½®å
é¨è½¬æ¢ç¶æä¸ºåå§è¿ç§»ç¶æï¼å¹¶è¥å½åå¤åèç¼ç éç¶æä¾èµï¼ä¸ä½¿ç¨è¿ç§»åºåï¼åè¿å â0â ï¼æè
è¥å½åå¤åèç¼ç ä¸ºç¶æä¾èµï¼ä½¿ç¨è¿ç§»åºåï¼åè¿åéé¶ã
示ä¾
#include <clocale> #include <string> #include <iostream> #include <cstdlib> #include <stdexcept>  // å¤åèå符串ä¸çåç¬¦æ°æ¯ mblen() çå // 注æï¼æ´ç®åçææ®µæ¯ std::mbstowcs(NULL, s.c_str(), s.size()) std::size_t strlen_mb(const std::string& s) { std::size_t result = 0; const char* ptr = s.data(); const char* end = ptr + s.size(); std::mblen(NULL, 0); // é置转æ¢ç¶æ while (ptr < end) { int next = std::mblen(ptr, end-ptr); if (next == -1) { throw std::runtime_error("strlen_mb(): conversion error"); } ptr += next; ++result; } return result; }  int main() { // å 许 mblen() 以 UTF-8 å¤åèç¼ç å·¥ä½ std::setlocale(LC_ALL, "en_US.utf8"); // UTF-8 çªå¤åèç¼ç std::string str = u8"z\u00df\u6c34\U0001f34c"; // or u8"zÃæ°´????" std::cout << str << " is " << str.size() << " bytes, but only " << strlen_mb(str) << " characters\n"; }
è¾åºï¼
zÃæ°´???? is 10 bytes, but only 4 characters
åé
| å°ä¸ä¸ä¸ªå¤åèåç¬¦è½¬æ¢æå®½å符 (彿°) | |
| ç»å®ç¶æï¼è¿åä¸ä¸ä¸ªå¤åèå符ä¸çåèæ° (彿°) |