That's not true! There are strings which gets longer when a composing
normalization is applied. Please see section 3.3 of Unicode Techical
Report 36:
http://www.unicode.org/reports/tr36/
> People assume that NFC always composes, and thus is the same or
> shorter length than the original source. However, some characters
> decompose in NFC.
(NFC = Normalization Form Composing.)
U+1D160 MUSICAL SYMBOL EIGHT NOTE is given as an example with a 3x
expansion factor when encoded in UTF-8 (I don't know what it expands to;
seems odd to me.)
-hpa
-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html