login
Header Space

 
 

Re: [2.6 patch] UTF-8 fixes in comments

Score:
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
To: Willy Tarreau <w@...>
Cc: H. Peter Anvin <hpa@...>, <linux-kernel@...>, <trivial@...>
Date: Tuesday, April 29, 2008 - 3:29 am

On Tue, Apr 29, 2008 at 07:06:05AM +0200, Willy Tarreau wrote:

Non-ancient distributions default to UTF-8 and have tools that handle it 
fine.

If you had bad experiences in the last millenium you should try again.


Accents are very rare in names in the kernel.

Most non-ASCII characters are umlauts and there's no sane way to 
express them in ASCII (and the vowels without umlaut are pronounced 
quite differently and might even make names look very strange).

And that's only within European languages, outside it becomes even 
worse.


The comments in the kernel have been converted to UTF-8 quite some time 
ago, what I'm fixing with my patch is just some recent non-UTF-8 stuff 
that creeped in.

And names in comments in the kernel were not pure ASCII since very 
early, they were in other charsets.

Mostly iso-8859-1, but not all of them.

I remember that for one name we first guessed which character it was and 
then tried to figure out which charset it was in (no, it was not one 
of iso-8859-*).

So it was not "ASCII -> UTF-8", it was
"several different charsets -> UTF-8".


cu
Adrian

-- 

       "Is there not promise of rain?" Ling Tan asked suddenly out
        of the darkness. There had been need of rain for many days.
       "Only a promise," Lao Er said.
                                       Pearl S. Buck - Dragon Seed

--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
[2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Mon Apr 28, 11:40 am)
Re: [2.6 patch] UTF-8 fixes in comments, KOSAKI Motohiro, (Tue Apr 29, 8:18 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Mon Apr 28, 7:05 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 5:01 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 5:34 am)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 5:41 am)
Re: [2.6 patch] UTF-8 fixes in comments, Jan Engelhardt, (Tue Apr 29, 5:19 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Mon Apr 28, 9:29 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 1:06 am)
Re: [2.6 patch] UTF-8 fixes in comments, David Kågedal, (Fri May 9, 8:48 am)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 3:29 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 4:14 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 3:31 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 4:05 pm)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 4:09 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 5:43 am)
Re: [2.6 patch] UTF-8 fixes in comments, Helge Hafting, (Tue Apr 29, 5:06 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 6:09 am)
Re: [2.6 patch] UTF-8 fixes in comments, Helge Hafting, (Wed Apr 30, 5:15 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Wed Apr 30, 3:42 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Wed Apr 30, 3:22 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 6:42 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 7:06 am)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 7:27 am)
Re: [2.6 patch] UTF-8 fixes in comments, Adrian Bunk, (Tue Apr 29, 7:32 am)
Re: [2.6 patch] UTF-8 fixes in comments, Jeremy Fitzhardinge, (Tue Apr 29, 4:18 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 6:10 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 3:33 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 6:33 am)
Re: [2.6 patch] UTF-8 fixes in comments, Alexander E. Patrakov, (Thu May 1, 5:46 am)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 6:34 am)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 6:12 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 6:15 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Willy Tarreau, (Tue Apr 29, 7:05 pm)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Thu May 1, 4:18 pm)
Re: [2.6 patch] UTF-8 fixes in comments, Alan Cox, (Tue Apr 29, 5:33 am)
Re: [2.6 patch] UTF-8 fixes in comments, H. Peter Anvin, (Tue Apr 29, 2:04 am)
speck-geostationary