Gentoo Archives: gentoo-dev

From: "Robin H. Johnson" <robbat2@g.o>
To: gentoo-dev@l.g.o
Subject: Re: [gentoo-dev] UTF-8 in GLEP56 use flags?
Date: Wed, 01 Oct 2008 23:52:03
Message-Id: 20081001233713.GM7634@curie-int.orbis-terrarum.net
In Reply to: Re: [gentoo-dev] UTF-8 in GLEP56 use flags? by Ben de Groot
1 On Thu, Oct 02, 2008 at 01:30:34AM +0200, Ben de Groot wrote:
2 > The xml header in each metadata.xml states that the content is UTF-8
3 > encoded, and any XML parser has to be able to handle this. Also, when
4 > used literally in xml, the 5 special characters & ' " < > cause a
5 > well-formedness error, as far as I know. U+2019 is the recommended form
6 > of using the apostrophe. So in my opinion, if we want to use xml, we
7 > should use unicode properly.
8 "xmllint --valid" is my check for well-formedness, and it says the
9 single quotes are fine.
10
11 According to http://www.w3.org/TR/xml11/#dt-chardata the chars "&" and
12 "<", ">" have "MUST" as their requirements for being escaped. Double and
13 single quotes "MAY" be escaped.
14
15 --
16 Robin Hugh Johnson
17 Gentoo Linux Developer & Infra Guy
18 E-Mail : robbat2@g.o
19 GnuPG FP : 11AC BA4F 4778 E3F6 E4ED F38E B27B 944E 3488 4E85