Re: [PATCH] Use FIX_UTF8_MAC to enable conversion from UTF8-MAC to UTF8

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: H. Peter Anvin
Date: Monday, January 21, 2008 - 9:08 pm

Mark Junker wrote:

That's not true!  There are strings which gets longer when a composing 
normalization is applied.  Please see section 3.3 of Unicode Techical 
Report 36:

	http://www.unicode.org/reports/tr36/

 > People assume that NFC always composes, and thus is the same or
 > shorter length than the original source. However, some characters
 > decompose in NFC.

(NFC = Normalization Form Composing.)

U+1D160 MUSICAL SYMBOL EIGHT NOTE is given as an example with a 3x 
expansion factor when encoded in UTF-8 (I don't know what it expands to; 
seems odd to me.)

	-hpa
-
To unsubscribe from this list: send the line "unsubscribe git" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
Re: [PATCH] Use FIX_UTF8_MAC to enable conversion from UTF ..., Johannes Schindelin, (Mon Jan 21, 4:24 am)
Re: [PATCH] Use FIX_UTF8_MAC to enable conversion from UTF ..., Johannes Schindelin, (Mon Jan 21, 5:09 am)
Re: [PATCH] Use FIX_UTF8_MAC to enable conversion from UTF ..., Johannes Schindelin, (Mon Jan 21, 12:14 pm)
Re: [PATCH] Use FIX_UTF8_MAC to enable conversion from UTF ..., H. Peter Anvin, (Mon Jan 21, 9:08 pm)