c16rtomb

From cppreference.com
< c‎ | string‎ | multibyte
Defined in header <uchar.h>
size_t c16rtomb( char* s, char16_t c16, mbstate_t* ps );
(since C11)

Converts a 16-bit wide character to narrow multibyte character.

If s is not a null pointer, the function determines the number of bytes necessary to store the multibyte character representation of c16 (including any shift sequences), and stores the multibyte character representation in the character array whose first element is pointed to by s. At most MB_CUR_MAX bytes can be written by this function.

If s is a null pointer, the call is equivalent to c16rtomb(buf, u'\0', ps) for some internal buffer buf.

If c16 is the null wide character u'\0', a null byte is stored, preceded by any shift sequence necessary to restore the initial shift state and the conversion state parameter *ps is updated to represent the initial shift state.

If the macro __STDC_UTF_16__ is defined, the 16-bit encoding used by this function is UTF-16, otherwise it is implementation-defined. In any case, the multibyte encoding used by this function is specified by the currently active C locale.

Contents

[edit] Parameters

s - pointer to narrow character array where the multibyte character will be stored
c16 - the 16-bit character to convert
ps - pointer to the conversion state object used when interpreting the multibyte string

[edit] Return value

On success, returns the number of bytes (including any shift sequences) written to the character array whose first element is pointed to by s. This value may be 0, e.g. when processing the first char16_t in a surrogate pair.

On failure (if c16 is not a valid 16-bit character), returns -1, stores EILSEQ in errno, and leaves *ps in unspecified state.

[edit] Notes

Unlike mbrtoc16, which converts variable-width multibyte (such as UTF-8) to variable-width 16-bit (such as UTF-16) encoding, this function can only convert single-unit 16-bit encoding, meaning it cannot convert UTF-16 to UTF-8.

[edit] Example

#include <stdio.h>
#include <locale.h>
#include <uchar.h>
#include <stdlib.h>
 
mbstate_t state;
int main(void)
{
    setlocale(LC_ALL, "en_US.utf8");
    char16_t str[] = u"zß水"; // or z\u00df\u6c34
    size_t str_sz = sizeof str / sizeof *str;
 
    printf("Processing %zu UCS-2 code units: [ ", str_sz);
    for(size_t n = 0; n < str_sz; ++n) printf("%#x ", str[n]); puts("]");
 
    char out[MB_CUR_MAX];
    for(size_t n = 0; n < str_sz; ++n)
    {   
        int rc = c16rtomb(out, str[n], &state);
        printf("%#x converted to [ ", str[n]);
        for(int x = 0; x < rc; ++x) printf("%#x ", +(unsigned char)out[x]); puts("]");
    }
}

Output:

Processing 4 UCS-2 code units: [ 0x7a 0xdf 0x6c34 0 ]
0x7a converted to [ 0x7a ]
0xdf converted to [ 0xc3 0x9f ]
0x6c34 converted to [ 0xe6 0xb0 0xb4 ]
0 converted to [ 0 ]

[edit] References

  • C11 standard (ISO/IEC 9899:2011):
  • 7.28.1.2 The c16rtomb function (p: 399-400)

[edit] See also

(C11)
generate the next 16-bit wide character from a narrow multibyte string
(function)
C++ documentation for c16rtomb