Null-terminated multibyte strings - [ C语言中文开发手册 ] - 在线原生手册

简体中文(ZH-CN) English(EN) 繁体中文(ZH-TW) 日本語(JA) 한국어(KO) Melayu(MS) Français(FR) Deutsch(DE)

目錄搜尋

Algorithms Algorithms（算法） bsearch bsearch_s qsort qsort_s Atomic operations Atomic operations library（原子操作库） ATOMIC_*_LOCK_FREE atomic_compare_exchange_strong atomic_compare_exchange_strong_explicit atomic_compare_exchange_weak atomic_compare_exchange_weak_explicit atomic_exchange atomic_exchange_explicit atomic_fetch_add atomic_fetch_add_explicit atomic_fetch_and atomic_fetch_and_explicit atomic_fetch_or atomic_fetch_or_explicit atomic_fetch_sub atomic_fetch_sub_explicit atomic_fetch_xor atomic_fetch_xor_explicit atomic_flag atomic_flag_clear atomic_flag_clear_explicit ATOMIC_FLAG_INIT atomic_flag_test_and_set atomic_flag_test_and_set_explicit atomic_init atomic_is_lock_free atomic_load atomic_load_explicit atomic_signal_fence atomic_store atomic_store_explicit atomic_thread_fence（线程围栏） ATOMIC_VAR_INIT kill_dependency memory_order（内存排序） C keywords auto（自动存储） break（跳出循环） C keywords（关键词） case char const（常量修饰符） continue default（预设运算式） do double（双精度浮点型） else enum（枚举类型） extern（全局变量） float（浮点数） for fortran goto（goto语句） if（if语句） inline（行内函式） int long（长整型） register（寄存器变量） restrict（ restrict类型限定符） return short signed sizeof（sizeof运算符） static（静态变量） struct（结构体） switch（switch语句） typedef（typedef关键字） union（联合体） unsigned（无符号） void（空类型） volatile（volatile变量） while（while语句） _Alignas _Alignof _Atomic _Bool _Complex _Generic _Imaginary _Noreturn _Static_assert _Thread_local C language #define directive #elif directive #else directive #endif directive #error directive #if directive #ifdef directive #ifndef directive #include directive #line directive #pragma directive alignas（对齐指定符） Alternative operators and tokens（替代运算符和令牌） Analyzability Arithmetic operators Arithmetic types Array declaration（数组声明） Array initialization（阵列初始化） ASCII Chart Assignment operators（赋值运算符） types（atomic类型限定符） Basic concepts Bit fields（位域） break statement C language C Operator Precedence cast operator character constant（字符字面量） Comments（注释符） Comparison operators（比较运算符） compound literals（符合字面量） Conditional inclusion（条件包含） Conformance（一致性） const type qualifier（const 限定符） Constant expressions（常量表达） continue statement Declarations（声明） do-while loop Enumerations（枚举类型） Escape sequences（转义字符） Expressions（表达式） External and tentative definitions（外部和暂定的定义） File scope（文件范围） floating constant（浮点常量） for loop Function declarations（函数声明） Function definitions（函数声明） Functions Generic selection泛型选择 goto statement Identifier（标示符） if statement Implicit conversions（隐式转换） Increment/decrement operators（前置/后置操作符） Initialization（初始化） inline function specifier（内联函式） integer constant Lifetime（生命期） Logical operators（逻辑运算符） Lookup and name spaces Main function（主函式） Member access operators（会员接入运营商） Memory model Objects and alignment（字节对齐） Order of evaluation（评估顺序） Other operators Phases of translation（翻译阶段） Pointer declaration Preprocessor（预处理） restrict type qualifier（restrict类型限定符） return statement Scalar initialization（标量类型初始化） Scope（范围） sizeof operator（sizeof运算符） Statements（陈述） static assert declaration（静态断言声明） Static storage duration（静态存储周期） Storage-class specifiers（存储类说明符） string literals（字符串字面量） Struct and union initialization（结构体与联合体初始化） Struct declaration（结构体声明） switch statement Thread storage duration（线程存储时间） Type Type（类型） Typedef declaration（Typedef声明） Undefined behavior（未定义行为） Union declaration（联合体声明） Value categories（值类别） Variadic arguments（变长参数宏） volatile type qualifier（volatile 类型限定符） while loop _Alignof operator _Noreturn function specifier Date and time asctime（asctime函数） asctime_s clock CLOCKS_PER_SEC clock_t ctime（ctime函数） ctime_s Date and time utilities（日期和时间库） difftime（计算两个时间的间隔） gmtime gmtime_s localtime localtime_s mktime（将时间结构数据转换成经过的秒数的函数） strftime（格式化输出时间函数） time timespec timespec_get time_t tm wcsftime（格式化时间宽字符） Dynamic memory management aligned_alloc C memory management library（内存管理库） calloc free（释放动态分配空间的函数） malloc（动态分配内存空间的函数） realloc（重新分配内存空间的函数） Error handling abort_handler_s assert（断言） constraint_handler_t errno（错误报告） Error handling（错误处理） Error numbers（错误个数） ignore_handler_s set_constraint_handler_s static_assert File input/output clearerr（清除/复位） fclose feof ferror fflush（清空文件缓冲区） fgetc fgetpos fgets fgetwc fgetws File input/output fopen fopen_s fprintf fprintf_s fputc fputs fputwc fputws fread freopen freopen_s fscanf fscanf_s fseek fsetpos ftell fwide fwprintf fwprintf_s fwrite fwscanf fwscanf_s getc getchar gets gets_s getwchar perror printf printf_s putc putchar puts putwc putwchar remove rename rewind scanf scanf_s setbuf setvbuf snprintf sprintf sscanf sscanf_s swprintf swprintf_s swscanf swscanf_s tmpfile tmpfile_s tmpnam tmpnam_s ungetc ungetwc vfprintf vfprintf_s vfscanf vfscanf_s vfwprintf vfwprintf_s vfwscanf vfwscanf_s vprintf vprintf_s vscanf vscanf_s vsnprintf vsprintf vsscanf vsscanf_s vswprintf vswprintf_s vswscanf vswscanf_s vwprintf vwprintf_s vwscanf vwscanf_s wprintf wprintf_s wscanf wscanf_s Localization support lconv LC_ALL LC_COLLATE LC_CTYPE LC_MONETARY LC_NUMERIC LC_TIME localeconv Localization support setlocale Numerics abs acos acosf acosh acoshf acoshl acosl asin asinf asinh asinhf asinhl asinl atan atan2 atan2f atan2l atanf atanh atanhf atanhl atanl cabs cabsf cabsl cacos cacosf cacosh cacoshf cacoshl cacosl carg cargf cargl casin casinf casinh casinhf casinhl casinl catan catanf catanh catanhf catanhl catanl cbrt cbrtf cbrtl ccos ccosf ccosh ccoshf ccoshl ccosl ceil ceilf ceill cexp cexpf cexpl cimag cimagf cimagl clog clogf clogl CMPLX CMPLXF CMPLXL Common mathematical functions complex Complex number arithmetic conj conjf conjl copysign copysignf copysignl cos cosf cosh coshf coshl cosl cpow cpowf cpowl cproj cprojf cprojl creal crealf creall csin csinf csinh csinhf csinhl csinl csqrt csqrtf csqrtl ctan ctanf ctanh ctanhf ctanhl ctanl div double_t erf erfc erfcf erfcl erff erfl exp exp2 exp2f exp2l expf expl expm1 expm1f expm1l fabs fabsf fabsl fdim feclearexcept fegetenv fegetexceptflag fegetround feholdexcept feraiseexcept fesetenv fesetexceptflag fesetround fetestexcept feupdateenv FE_ALL_EXCEPT FE_DFL_ENV FE_DIVBYZERO FE_DOWNWARD FE_INEXACT FE_INVALID FE_OVERFLOW FE_TONEAREST FE_TOWARDZERO FE_UNDERFLOW FE_UPWARD Floating-point environment float_t floor floorf floorl fma fmaf fmal fmax fmaxf fmaxl fmin fminf fminl fmod fmodf fmodl fpclassify FP_INFINITE FP_NAN FP_NORMAL FP_SUBNORMAL FP_ZERO frexp frexpf frexpl HUGE_VAL HUGE_VALF HUGE_VALL hypot hypotf hypotl I ilogb ilogbf ilogbl imaginary imaxabs imaxdiv INFINITY isfinite isgreater isgreaterequal isinf isless islessequal islessgreater isnan isnormal isunordered labs ldexp ldexpf ldexpl ldiv lgamma lgammaf lgammal llabs lldiv llrint llrintf llrintl llround llroundf llroundl log log10 log10f log10l log1p log1pf log1pl log2 log2f log2l logb logbf logbl logf logl lrint lrintf lrintl lround lroundf lroundl MATH_ERREXCEPT math_errhandling MATH_ERRNO modf modff modfl nan NAN nanf nanl nearbyint nearbyintf nearbyintl nextafter nextafterf nextafterl nexttoward nexttowardf nexttowardl Numerics pow powf powl Pseudo-random number generation rand RAND_MAX remainder remainderf remainderl remquo remquof remquol rint rintf rintl round roundf roundl scalbln scalblnf scalblnl scalbn scalbnf scalbnl signbit sin sinf sinh sinhf sinhl sinl sqrt sqrtf sqrtl srand tan tanf tanh tanhf tanhl tanl tgamma tgammaf tgammal trunc truncf truncl Type-generic math _Complex_I _Imaginary_I Program support abort atexit at_quick_exit exit EXIT_FAILURE EXIT_SUCCESS getenv getenv_s jmp_buf longjmp Program support utilities quick_exit raise setjmp SIGABRT SIGFPE SIGILL SIGINT signal SIGSEGV SIGTERM sig_atomic_t SIG_DFL SIG_ERR SIG_IGN system _Exit Strings atof atoi atol atoll btowc c16rtomb c32rtomb char16_t char32_t isalnum isalpha isblank iscntrl isdigit isgraph islower isprint ispunct isspace isupper iswalnum iswalpha iswblank iswcntrl iswctype iswdigit iswgraph iswlower iswprint iswpunct iswspace iswupper iswxdigit isxdigit mblen mbrlen mbrtoc16 mbrtoc32 mbrtowc mbsinit mbsrtowcs mbsrtowcs_s mbstate_t mbstowcs mbstowcs_s mbtowc memchr memcmp memcpy memcpy_s memmove memmove_s memset memset_s Null-terminated byte strings Null-terminated multibyte strings Null-terminated wide strings strcat strcat_s strchr strcmp strcoll strcpy strcpy_s strcspn strerror strerrorlen_s strerror_s Strings library strlen strncat Thread support call_once cnd_broadcast cnd_destroy cnd_init cnd_signal cnd_timedwait cnd_wait mtx_destroy mtx_init mtx_lock mtx_plain mtx_recursive mtx_timed mtx_timedlock mtx_trylock mtx_unlock once_flag ONCE_FLAG_INIT thrd_busy thrd_create thrd_current thrd_detach thrd_equal thrd_error thrd_exit thrd_join thrd_nomem thrd_sleep thrd_success thrd_timedout thrd_yield Thread support library thread_local tss_create tss_delete TSS_DTOR_ITERATIONS tss_get tss_set Type support Boolean type support library Fixed width integer types FLT_EVAL_METHOD FLT_ROUNDS max_align_t NULL Numeric limits offsetof ptrdiff_t size_t Type support Variadic functions Variadic functions va_arg va_copy va_end va_list va_start

文字

以空字符结尾的多字节字符串（NTMBS）或“多字节字符串”是一个非零字节序列，后跟一个值为零的字节（终止空字符）。

存储在该字符串中的每个字符可占用多于一个字节。用于表示多字节字符串中字符的编码是特定于语言环境的：它可以是UTF-8，GB18030，EUC-JP，Shift-JIS等。例如，字符数组{'\ xe4'，'\ xbd '，'\ xa0'，'\ xe5'，'\ xa5'，'\ xbd'，'\ 0'}是以UTF-8多字节编码形式存储字符串“你好”的NTMBS：前三个字节编码字符【你】，接下来的三个字节编码字符【好】。在GB18030中编码的字符串是字符数组{'\ xc4'，'\ xe3'，'\ xba'，'\ xc3'，'\ 0'}，其中两个字符中的每一个都被编码为一个双字节序列。

在某些多字节编码中，任何给定的多字节字符序列都可能表示不同的字符，这取决于先前的字节序列，称为“移位序列”。这种编码被称为状态依赖：需要了解当前的转换状态来解释每个字符。NTMBS只有在开始和结束于初始转换状态时才有效：如果使用移位序列，则相应的非移位序列必须存在于终止空字符之前。这种编码的例子是BOCU-1和SCSU。

多字节字符串与空字符串字符串（NTBS）是布局兼容的，也就是说，除了计算字符数外，可以使用相同的工具来存储，复制和检查。如果正确的语言环境有效，I / O函数还处理多字节字符串。使用以下与区域设置相关的转换函数，可以将多字节字符串转换为宽字符串并从中转换为宽字符串：

多字节/宽字符转换

| 在头文件<stdlib.h>中定义 |

|:----|

| mblen | 返回下一个多字节字符（函数）中的字节数|

| mbtowc | 将下一个多字节字符转换为宽字符（函数）|

| wctombwctomb_s（C11）| 将宽字符转换为其多字节表示（函数）|

| mbstowcsmbstowcs_s（C11）| 将窄多字节字符串转换为宽字符串（函数）|

| wcstombswcstombs_s（C11）| 将宽字符串转换为窄多字节字符串（函数）|

| 在头文件<wchar.h>中定义 |

| mbsinit（C95）| 检查mbstate_t对象是否代表初始转换状态（函数）|

| btowc（C95）| 如果可能的话将单字节窄字符扩展为宽字符（函数）|

| wctob（C95）| 如果可能，则将宽字符缩小为单字节窄字符|（函数）

| mbrlen（C95）| 返回给定状态（函数）下一个多字节字符的字节数|

| mbrtowc（C95）| 将下一个多字节字符转换为宽字符，给定状态（函数）|

| wcrtombwcrtomb_s（C95）（C11）| 将宽字符转换为其多字节表示，给定状态（函数）|

| mbsrtowcsmbsrtowcs_s（C95）（C11）| 将一个窄的多字节字符串转换为宽字符串，给定状态（函数）|

| wcsrtombswcsrtombs_s（C95）（C11）| 将宽字符串转换为窄多字节字符串，给定状态（函数）|

| 在头文件<uchar.h>中定义 |

| mbrtoc16（C11）| 从一个窄多字节字符串（函数）生成下一个16位宽字符 |

| c16rtomb（C11）| 将16位宽字符转换为窄多字节字符串（函数）|

| mbrtoc32（C11）| 从一个窄多字节字符串（函数）生成下一个32位宽字符|

| c32rtomb（C11）| 将32位宽字符转换为窄多字节字符串（函数）|

类型

| 在头文件<wchar.h>中定义 |

|:----|

| mbstate_t（C95）| 转换多字节字符串（类）所需的转换状态信息|

| 在头文件<uchar.h>中定义 |

| char16_t（C11）| 16位宽字符类型（typedef）|

| char32_t（C11）| 32位宽字符类型（typedef）|

宏

| 在头文件<limits.h>中定义 |

|:----|

| MB_LEN_MAX | 对于任何受支持的语言环境（宏常量），多字节字符中的最大字节数|

| 在头文件<stdlib.h> 中定义|

| MB_CUR_MAX | 当前语言环境（宏变量）中多字节字符中的最大字节数|

| 在头文件<uchar.h>中定义 |

| __STDC_UTF_16__（C11）| 表示mbrtoc16和c16rtomb（宏常量）使用UTF-16编码|

| __STDC_UTF_32__（C11）| 指示由mbrtoc32和c32rtomb（宏常量）使用UTF-32编码|

参考

C11标准（ISO / IEC 9899：2011）：

7.10整数类型的大小<limits.h>（p：222）
7.22常用工具<stdlib.h>（p：340-360）
7.28 Unicode实用程序<uchar.h>（p：398-401）
7.29扩展的多字节和宽字符实用程序<wchar.h>（p：402-446）
7.31.12通用工具<stdlib.h>（p：456）
7.31.16扩展的多字节和宽字符实用程序<wchar.h>（p：456）
K.3.6通用工具<stdlib.h>（p：604-614）
K.3.9扩展的多字节和宽字符实用程序<wchar.h>（p：627-651）

C99标准（ISO / IEC 9899：1999）：

7.10整数类型的大小<limits.h>（p：203）
7.20通用工具<stdlib.h>（p：306-324）
7.24扩展的多字节和宽字符实用程序<wchar.h>（p：348-392）
7.26.10通用工具<stdlib.h>（p：402）
7.26.12扩展的多字节和宽字符实用程序<wchar.h>（p：402）

C89 / C90标准（ISO / IEC 9899：1990）：

4.1.4限制<float.h>和<limits.h>
4.10一般实用程序<stdlib.h>
4.13.7通用工具<stdlib.h>

扩展内容

| 用于Null终止的多字节字符串的C ++文档 |

|:----|

本文档系腾讯云云+社区成员共同维护，如有问题请联系 yunjia_community@tencent.com

在头文件<wctype.h>中定义
int iswdigit（wint_t ch）;		（自C95以来）

检查给定的宽字符是否对应（如果缩小）十个十进制数字字符0123456789中的一个。

参数

CH	-	宽字符

返回值

如果宽字符是数字字符，则为非零值，否则为零。

注意

iswdigit与iswxdigit是唯一不受当前安装的C语言环境影响的标准宽字符分类函数。

例

一些语言环境提供了检测非ASCII数字的附加字符类。

#include <stdio.h>#include <wctype.h>#include <wchar.h>#include <locale.h> void test(wchar_t a3, wchar_t u3, wchar_t j3){    printf("        '%lc' '%lc' '%lc'\n", a3, u3, j3);    printf("iswdigit %d    %d   %d\n",           !!iswdigit(a3), !!iswdigit(u3), !!iswdigit(j3));    printf("jdigit:  %d    %d   %d\n", !!iswctype(a3, wctype("jdigit")),          !!iswctype(u3, wctype("jdigit")),          !!iswctype(j3, wctype("jdigit")));}
 int main(void){
    wchar_t a3 = L'3';  // the ASCII digit 3
    wchar_t u3 = L'三'; // the CJK numeral 3
    wchar_t j3 = L'３'; // the fullwidth digit 3 
    setlocale(LC_ALL, "en_US.utf8");    puts("In American locale:");    test(a3, u3, j3); 
    setlocale(LC_ALL, "ja_JP.utf8");    puts("\nIn Japanese locale:");    test(a3, u3, j3);}

输出：

In American locale:        '3' '三' '３'iswdigit 1    0   0jdigit:  0    0   0
 In Japanese locale:        '3' '三' '３'iswdigit 1    0   0jdigit:  0    0   1

参考

C11标准（ISO / IEC 9899：2011）：

7.30.2.1.5 iswdigit函数（p：449）

C99标准（ISO / IEC 9899：1999）：

7.25.2.1.5 iswdigit函数（p：395）

扩展内容

isdigit	检查一个字符是否是一个数字（功能）

| 用于iswdigit的C ++文档 |

ASCII 值 (十六进制)	字符	iscntrl iswcntrl.	isprint iswprint.	isspace iswspace.	isblank iswblank.	isgraph iswgraph.	ispunct iswpunct.	isalnum iswalnum.	isalpha iswalpha.	isupper iswupper.	islower iswlower.	isdigit iswdigit.	isxdigit iswxdigit.
0 - 8	0x00-0x08	控制码 (NUL, etc.)	≠0	0	0	0	0	0	0	0	0	0	0	0
9	0x09	tab (\t)	≠0	0	≠0	≠0	0	0	0	0	0	0	0	0
10 - 13	0x0A-0x0D	空格 (\n,\v,\f,\r)	≠0	0	≠0	0	0	0	0	0	0	0	0	0
14 - 31	0x0E-0x1F	控制码	≠0	0	0	0	0	0	0	0	0	0	0	0
32	0x20	space	0	≠0	≠0	≠0	0	0	0	0	0	0	0	0
33 - 47	0x21-0x2F	!"#$%&'()*+,-./	0	≠0	0	0	≠0	≠0	0	0	0	0	0	0
48 - 57	0x30-0x39	0123456789	0	≠0	0	0	≠0	0	≠0	0	0	0	≠0	≠0
58 - 64	0x3a-0x40	:;<=>?@	0	≠0	0	0	≠0	≠0	0	0	0	0	0	0
65 - 70	0x41-0x46	ABCDEF	0	≠0	0	0	≠0	0	≠0	≠0	≠0	0	0	≠0
71 - 90	0x47-0x5A	GHIJKLMNOPQRSTUVWXYZ	0	≠0	0	0	≠0	0	≠0	≠0	≠0	0	0	0
91 - 96	0x5B-0x60	[]^_` \| 0 \| ≠0 \| 0 \| 0 \| ≠0 \| ≠0 \| 0 \| 0 \| 0 \| 0 \| 0 \| 0 \|
97 -102	0x61-0x66	abcdef	0	≠0	0	0	≠0	0	≠0	≠0	0	≠0	0	≠0
103-122	0x67-0x7A	ghijklmnopqrstuvwxyz	0	≠0	0	0	≠0	0	≠0	≠0	0	≠0	0	0
123-126	0x7B-0x7E	{\|}~	0	≠0	0	0	≠0	≠0	0	0	0	0	0	0
127	0x7F	退格 (DEL)	≠0	0	0	0	0	0	0	0	0	0	0	0