1 |
>>>>> On Mon, 06 Jun 2016, Mart Raudsepp wrote: |
2 |
|
3 |
> Usually only two letter language codes suffice, but can be limited with |
4 |
> country codes with a 'll_CC' formatting, where 'll' is the language code |
5 |
> and 'CC' is the country code, e.g en_GB. Some rare languages also have |
6 |
> three letter language codes. |
7 |
|
8 |
s/country code/territory code/g |
9 |
|
10 |
Question related to this, do we take the opportunity to standardise |
11 |
the values? Looks like the vast majority follows |
12 |
language[_territory][@modifier] specified by POSIX [1] but some don't. |
13 |
|
14 |
Also there are a few duplicates, like sr@Latn / sr@latin and uz@Cyrl / |
15 |
uz@cyrillic. I suggest that we adhere to the BCP 47 [2] names if |
16 |
possible (which would be Latn and Cyrl for the examples mentioned). |
17 |
|
18 |
Ulrich |
19 |
|
20 |
|
21 |
[1] http://pubs.opengroup.org/onlinepubs/9699919799/basedefs/V1_chap08.html#tag_08_02 |
22 |
[2] http://www.rfc-editor.org/rfc/bcp/bcp47.txt |