在MSDN上的介绍
A multibyte character is a character composed of sequences of one or more bytes. Each byte sequence represents a single character in the extended character set. Multibyte characters are used in character sets such as Kanji.
Wide characters are multilingual character codes that are always 16 bits wide. The type for character constants is char; for wide characters, the type is wchar_t. Since wide characters are always a fixed size, using wide characters simplifies programming with international character sets.
The wide-character-string literal L"hello" becomes an array of six integers of type wchar_t.
The Unicode specification is the specification for wide characters. The run-time library routines for translating between multibyte and wide characters include mbstowcs, mbtowc, wcstombs, and wctomb.
Reference
multibyte characters指的是多字节表示一个字符。
multibyte characters是MS制定的标准,界于UNICODE和ANSI字符之间,所有小于127的字符都用一个字节表示自己,如‘A '用0x41即可;所有大于127字符都表示它是个领头字节,将同后面一个字节联合起来表示一个字符,如中文字符。因此它的结尾符只需一个字节0x00.
本文介绍了宽字符和多字节字符的概念及其在编程中的应用。宽字符使用固定大小(16位)来表示多语言字符集,简化了国际字符集的编程。多字节字符则使用一或多字节序列表示扩展字符集中的字符,适用于如汉字等复杂字符集。


1036

被折叠的 条评论
为什么被折叠?



