Gentoo Archives: gentoo-dev

From: Dan Douglas <ormaaj@×××××.com>
To: gentoo-dev@l.g.o
Subject: Re: [gentoo-dev] UTF-8 locale by default
Date: Fri, 27 Jul 2012 08:08:06
Message-Id: 2734677.97hjmIHWxX@smorgbox
In Reply to: Re: [gentoo-dev] UTF-8 locale by default by Ulrich Mueller
1 On Friday, July 27, 2012 09:08:36 AM Ulrich Mueller wrote:
2 > >>>>> On Fri, 27 Jul 2012, Ben de Groot wrote:
3 >
4 > > I understand why the council rejected Debian's C.UTF-8 option,
5 > > but is there really no better default that we can use?
6 >
7 > > Without any default locale set, in practically all cases that means
8 > > that the user is presented with English, and mostly the American
9 > > variant. So, in practice, we are defaulting to en_US, just not in a
10 > > unicode environment. Correct me if I'm wrong.
11 >
12 > See below. We're not defaulting to en_US for things like the number
13 > format.
14 >
15 > > Also, in most other places (such as our website, GLEPs, ebuilds)
16 > > we default to en_US.UTF-8.
17 >
18 > > So let's upgrade to en_US.UTF-8, which is for most users more
19 > > desirable than the current situation. Of course we will still advise
20 > > them to set their desired locales in /etc/locale.gen. But at least
21 > > they will start with a unicode environment, as expected anno 2012.
22 >
23 > As I had pointed out before [1], changing from POSIX to an en_US
24 > locale will have undesirable side effects, like commas as thousands
25 > separators in numbers (because of LC_NUMERIC). Also the defaults of
26 > en_US for LC_MEASUREMENT and LC_PAPER are only useful in the U.S.
27 >
28 > So if we change the default (but I still don't see the need), we
29 > should go for a less intrusive setting like:
30 >
31 > LANG="POSIX"
32 > LC_CTYPE="en_US.utf8"
33 >
34 > Ulrich
35 >
36
37 You're concerned about the commas breaking things? Given that you usually need
38 to specifically ask for them (i.e., printf ' flag), and that kind of output is
39 usually going to be for human consumption only that seems unlikely. If
40 anything does rely upon the format, can't tolerate different locales, and fails
41 to specify LC_NUMERIC then it's broken anyway.
42
43 LC_MONETARY / LC_MEASUREMENT as en_US are probably slightly more annoying
44 defaults for some people. What do users of other distros think? Is this really
45 a serious problem for anyone?
46
47 LC_CTYPE=en_US.utf8 would be a bare minimum. The important bit is getting utf8
48 by default. I can live with LANG=POSIX.
49 --
50 Dan Douglas

Attachments

File name MIME type
signature.asc application/pgp-signature

Replies

Subject Author
Re: [gentoo-dev] UTF-8 locale by default Ben de Groot <yngwin@g.o>