Known Anomalies in Unicode Character Names
18 hours ago
- #Technical Note
- #Character Names
- #Unicode
- This document lists known anomalies in Unicode character names, including misspellings, misleading names, and other issues.
- Unicode character names are immutable due to the Name Stability Policy, ensuring stability for standards referencing Unicode.
- Normative character name aliases are provided for characters with serious errors, allowing alternative identifiers without changing the original name.
- Informative aliases are also provided for better communication, though they are not guaranteed to be unique or stable.
- The document serves as a summary of character name anomalies and will be updated as new issues are identified.
- Examples of anomalies include U+0149 (misnamed as a single letter), U+01A2 and U+01A3 (should be 'GHA'), and U+02C7 (should be 'hacek').
- Some characters, like U+039B and U+03BB, use 'lamda' instead of 'lambda' due to ISO 10646 conventions.
- Hebrew cantillation marks U+0598 and U+05AE have naming inconsistencies between the prose and poetic systems.
- The document acknowledges contributions from various experts and lists modifications from previous versions.