Checks whether an Unicode code point is a bidirectional control character.
Return !=0 if unicode alpha. Use table from C99 Appendix D.
Returns the code length of c in code units for the encoding. sz is the encoding: 1 = utf8, 2 = utf16, 4 = utf32.
Returns the code length of c in code units.
Decode a UTF-8 sequence as a single UTF-32 code point.
Decode a UTF-16 sequence as a single UTF-32 code point.
The Unicode code space is the range of code points [0x000000,0x10FFFF] except the UTF-16 surrogate pairs in the range [0xD800,0xDFFF]
Functions related to UTF encoding.