liba 0.1.15
An algorithm library based on C/C++
Loading...
Searching...
No Matches
utf.h File Reference

Unicode Transformation Format. More...

#include "a.h"
Include dependency graph for utf.h:

Go to the source code of this file.

Macros

#define a_utf_len(str)
 

Functions

unsigned int a_utf_encode (void *str, uint32_t val)
 encode one unicode code point into UTF-8
 
unsigned int a_utf_decode (void const *str, uint32_t *val)
 decode one unicode code point from UTF-8
 
size_t a_utf_length (void const *str)
 length of a UTF-8 string terminated with a null character
 

Detailed Description

Unicode Transformation Format.

Unicode UTF-8
U+0000000 ~ U+0000007F 0XXXXXXX
U+0000080 ~ U+000007FF 110XXXXX 10XXXXXX
U+0000800 ~ U+0000FFFF 1110XXXX 10XXXXXX 10XXXXXX
U+0010000 ~ U+001FFFFF 11110XXX 10XXXXXX 10XXXXXX 10XXXXXX
U+0200000 ~ U+03FFFFFF 111110XX 10XXXXXX 10XXXXXX 10XXXXXX 10XXXXXX
U+4000000 ~ U+7FFFFFFF 1111110X 10XXXXXX 10XXXXXX 10XXXXXX 10XXXXXX 10XXXXXX