| Unicode Normalization |
Article Index for Unicode |
Shopping Unicode |
Website Links For Unicode |
Information AboutUnicode Normalization |
| CATEGORIES ABOUT UNICODE NORMALIZATION | |
| unicode | |
| unicode algorithms | |
| normalization | |
|
COMPOSITION AND DECOMPOSITION Underlying Unicode's normalization methods is the concept of character composition and decomposition. Character composition is the process of combining simpler characters into fewer Precomposed Character s, such as the n character and the combining ~ character into the single ñ character. Decomposition is the opposite process, breaking precomposed characters back into their component pieces. Unicode composes combining characters and decomposes compatibility characters based on what it calls and Unicode Compatibility Characters for more information. STANDARDS Unicode defines four normalization standards. All the above methods will standardize the order in which decomposed characters appear, even sequences that were already decomposed prior to normalization. They may also replace characters or sequences with equivalent characters or sequences even if it doesn't result in the number of characters changing. These are done to achieve the consistency in encoding required for normalization. SEE ALSO
EXTERNAL LINKS
|
|
|