Re: [2.6 patch] UTF-8 fixes in comments

!MAILaRCHIVE_VOTE_RePLACE
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Willy Tarreau <w@...>
Cc: Helge Hafting <helge.hafting@...>, H. Peter Anvin <hpa@...>, <linux-kernel@...>, <trivial@...>
Date: Tuesday, April 29, 2008 - 6:42 am

On Tue, Apr 29, 2008 at 12:09:34PM +0200, Willy Tarreau wrote:

I can reproduce your problem in a plain xterm when setting LANG=en_US
(most likely the same problem can occur with other non UTF-8 settings).

In this case I'm actually more surprised that the character is displayed 
correctly than that you have to type backspace twice.

Any kind of charset mixing is highly problematic (which is also why my 
patch was attached compressed), so if you disable UTF-8 anywhere in a 
modern distribution problems are somehow expected (it could also be a 
bug in Mandrivas default settings, but that would really surprise me).


It's not a compressed encoding, it's a variable-length encoding.

Besides the size advantages one main advantage of UTF-8 is that ASCII is 
valid UTF-8. This means that for the ASCII source code in the kernel it 
doesn't matter whether it's treated as ASCII or UTF-8, and no conversion 
was needed.

You can't get this property with a fixed-size Unicode encoding.


cu
Adrian

-- 

       "Is there not promise of rain?" Ling Tan asked suddenly out
        of the darkness. There had been need of rain for many days.
       "Only a promise," Lao Er said.
                                       Pearl S. Buck - Dragon Seed

--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
[2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Mon Apr 28, 11:40 am)
Re: [2.6 patch] UTF-8 fixes in comments, KOSAKI Motohiro, (Tue Apr 29, 8:18 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Mon Apr 28, 7:05 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 5:01 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 5:34 am)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 5:41 am)
Re: [2.6 patch] UTF-8 fixes in comments, Jan Engelhardt, (Tue Apr 29, 5:19 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Mon Apr 28, 9:29 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 1:06 am)
Re: [2.6 patch] UTF-8 fixes in comments, David Kågedal, (Fri May 9, 8:48 am)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 3:29 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 4:14 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 3:31 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 4:05 pm)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 4:09 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 5:43 am)
Re: [2.6 patch] UTF-8 fixes in comments, Helge Hafting, (Tue Apr 29, 5:06 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 6:09 am)
Re: [2.6 patch] UTF-8 fixes in comments, Helge Hafting, (Wed Apr 30, 5:15 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Wed Apr 30, 3:42 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Wed Apr 30, 3:22 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 6:42 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 7:06 am)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 7:27 am)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 7:32 am)
Re: [2.6 patch] UTF-8 fixes in comments, Jeremy Fitzhardinge, (Tue Apr 29, 4:18 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 6:10 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 3:33 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 6:33 am)
Re: [2.6 patch] UTF-8 fixes in comments, Alexander E. Patrakov, (Thu May 1, 5:46 am)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 6:34 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 6:12 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 6:15 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 7:05 pm)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Thu May 1, 4:18 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 5:33 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 2:04 am)