The following enumerated values are the possible types of decomposition of a Unicode character.
UCD marker:
<font>. Denotes a font variant (e.g. a blackletter form).
UCD marker:
<noBreak>. Denotes a no-break version of a space or hyphen.
UCD marker:
<initial>. Denotes an initial presentation form (Arabic).
UCD marker:
<isolated>. Denotes an isolated presentation form (Arabic).
UCD marker:
<vertical>. Denotes a vertical layout presentation form.
UCD marker:
<wide>. Denotes a wide (or zenkaku) compatibility character.
UCD marker:
<narrow>. Denotes a narrow (or hankaku) compatibility character.
UCD marker:
<small>. Denotes a small variant form (CNS compatibility).
UCD marker:
<compat>. Denotes an otherwise unspecified compatibility character.
The following constant denotes the maximum size of decomposition of a single Unicode character.
This macro expands to a constant that is the required size of buffer passed to the
uc_decompositionanduc_canonical_decompositionfunctions.
The following functions decompose a Unicode character.
Returns the character decomposition mapping of the Unicode character uc. decomposition must point to an array of at least
UC_DECOMPOSITION_MAX_LENGTHucs_telements.When a decomposition exists, decomposition
[0..n-1]and*decomp_tag are filled and n is returned. Otherwise -1 is returned.
Returns the canonical character decomposition mapping of the Unicode character uc. decomposition must point to an array of at least
UC_DECOMPOSITION_MAX_LENGTHucs_telements.When a decomposition exists, decomposition
[0..n-1]is filled and n is returned. Otherwise -1 is returned.