Gentoo Archives: gentoo-dev

From: "Michał Górny" <mgorny@g.o>
To: gentoo-dev@l.g.o
Cc: "Michał Górny" <mgorny@g.o>
Subject: [gentoo-dev] [PATCH v2 2/3] glep-0074: Specify supported hash algorithms
Date: Mon, 12 Sep 2022 13:39:44
Message-Id: 20220912133850.1202010-3-mgorny@gentoo.org
In Reply to: [gentoo-dev] [PATCH v2 0/3] glep-0074: Explicitly specify hashes and compressed Manifest formats by "Michał Górny"
1 Replace the informational hash name section with a formal specification
2 of allowed hash algorithms. The original reasoning for leaving them
3 implementation-defined was poor. After all, not a single new hash
4 was added since the initial version of the GLEP. At the same time,
5 ensuring consistent support for at least a minimal set of hash
6 algorithms is crucial to interoperability. Given that the effort needed
7 to update the GLEP is relatively small, it is better to require all
8 algorithms to be formally listed than to have to track all
9 implementations for new hashes and hope for consistency.
10
11 Signed-off-by: Michał Górny <mgorny@g.o>
12 ---
13 glep-0074.rst | 136 ++++++++++++++++++++++++++++++++------------------
14 1 file changed, 88 insertions(+), 48 deletions(-)
15
16 diff --git a/glep-0074.rst b/glep-0074.rst
17 index 8cec618..9a0e92b 100644
18 --- a/glep-0074.rst
19 +++ b/glep-0074.rst
20 @@ -6,7 +6,7 @@ Author: Michał Górny <mgorny@g.o>,
21 Ulrich Müller <ulm@g.o>
22 Type: Standards Track
23 Status: Final
24 -Version: 1.2
25 +Version: 1.3
26 Created: 2017-10-21
27 Last-Modified: 2022-09-11
28 Post-History: 2017-10-26, 2017-11-16, 2018-02-08, 2022-09-08, 2022-09-11
29 @@ -26,6 +26,9 @@ efficient and provide means of backwards compatibility.
30 Changes
31 =======
32
33 +v1.3
34 + Formally specified the current set of hash algorithms supported.
35 +
36 v1.2
37 Specified the newline convention used for Manifests.
38
39 @@ -364,27 +367,60 @@ up to and including the *original* directory. Note that those
40 sub-Manifests can use different filenames than ``Manifest``.
41
42
43 -Checksum algorithms (informational)
44 ------------------------------------
45 -
46 -This section is informational only. Specifying the exact set
47 -of supported algorithms is outside the scope of this specification.
48 -
49 -The algorithm names reserved at the time of writing are:
50 -
51 -- ``MD5`` [#MD5]_,
52 -- ``RMD160`` -- RIPEMD-160 [#RIPEMD160]_,
53 -- ``SHA1`` [#SHS]_,
54 -- ``SHA256`` and ``SHA512`` -- SHA-2 family of hashes [#SHS]_,
55 -- ``WHIRLPOOL`` [#WHIRLPOOL]_,
56 -- ``BLAKE2B`` and ``BLAKE2S`` -- BLAKE2 family of hashes [#BLAKE2]_,
57 -- ``SHA3_256`` and ``SHA3_512`` -- SHA-3 family of hashes [#SHA3]_,
58 -- ``STREEBOG256`` and ``STREEBOG512`` -- Streebog family of hashes
59 - [#STREEBOG]_.
60 -
61 -The method of introducing new hashes is defined by GLEP 59 [#GLEP59]_.
62 -It is recommended that any new hashes are named after the Python
63 -``hashlib`` module algorithm names, transformed into uppercase.
64 +Checksum algorithms
65 +-------------------
66 +
67 +.. table:: Table 1. Defined hash algorithms
68 + :widths: auto
69 +
70 + +-----------------+-----------------------+------+------+-------------+
71 + | Name | Specification | Bits | Enc. | Notes |
72 + +=================+=======================+======+======+=============+
73 + | ``BLAKE2B`` | | 512 | Hex | Recommended |
74 + +-----------------+ RFC 7693 [#RFC7693]_ +------+------+-------------+
75 + | ``BLAKE2S`` | | 256 | Hex | |
76 + +-----------------+-----------------------+------+------+-------------+
77 + | ``MD5`` | RFC 1321 [#RFC1321]_ | 128 | Hex | Deprecated |
78 + +-----------------+-----------------------+------+------+-------------+
79 + | ``RMD160`` | RIPEMD-160 [#RMD160]_ | 160 | Hex | Deprecated |
80 + +-----------------+-----------------------+------+------+-------------+
81 + | ``SHA1`` | | 160 | Hex | Deprecated |
82 + +-----------------+ +------+------+-------------+
83 + | ``SHA256`` | FIPS 180-4 [#SHS]_ | 256 | Hex | |
84 + +-----------------+ +------+------+-------------+
85 + | ``SHA512`` | | 512 | Hex | Recommended |
86 + +-----------------+-----------------------+------+------+-------------+
87 + | ``SHA3_256`` | | 256 | Hex | |
88 + +-----------------+ FIPS 202 [#SHA3]_ +------+------+-------------+
89 + | ``SHA3_512`` | | 512 | Hex | |
90 + +-----------------+-----------------------+------+------+-------------+
91 + | ``STREEBOG256`` | | 256 | Hex | |
92 + +-----------------+ RFC 6986 [#RFC6986]_ +------+------+-------------+
93 + | ``STREEBOG512`` | | 512 | Hex | |
94 + +-----------------+-----------------------+------+------+-------------+
95 + | ``WHIRLPOOL`` | Whirlpool [#BARRETO]_ | 512 | Hex | |
96 + +-----------------+-----------------------+------+------+-------------+
97 +
98 +Any new hashes must be added to this specification prior to being used
99 +in Manifest files. Adding a new hash is considered
100 +a backwards-compatible change to the GLEP. It is recommended that new
101 +hashes are named after the Python ``hashlib`` module algorithm names,
102 +transformed into uppercase, with dashes replaced by underscores.
103 +
104 +An implementation can implement an arbitrary subset of the listed
105 +hashes. For best interoperability, it should implement at least
106 +recommended hashes. If deprecated hashes are implemented, it is
107 +preferable to disallow their use by default.
108 +
109 +If an entry specifies multiple hashes using different algorithms,
110 +an implementation may choose to verify an arbitrary subset of them.
111 +However, should any tested hash yield a mismatch, the verification must
112 +fail.
113 +
114 +If a particular hash is either unsupported or unknown,
115 +the implementation can either ignore it or report a failure. However,
116 +at least one algorithm specified for a particular entry must be
117 +supported for the verification to succeed.
118
119
120 Manifest compression
121 @@ -913,23 +949,25 @@ tool working with this Manifest format.
122 Hash algorithms
123 ---------------
124
125 -While maintaining a consistent supported hash set is important
126 -for interoperability, it is not a good fit for the generic layout
127 -of this GLEP. Furthermore, it would require updating the GLEP
128 -in the future every time the used algorithms change.
129 +Originally, this GLEP did not formally specify the complete set of hash
130 +algorithms. Instead, it only listed (informationally) the names already
131 +used at the time of writing. Since enforcing consistent use of algorithm
132 +names is important for interoperability, this was changed in version
133 +1.3.
134
135 -Instead, the specification focuses on listing the currently used
136 -algorithm names for interoperability, and sets a recommendation
137 -for consistent naming of algorithms in the future. The Python
138 -``hashlib`` module is used as a reference since it is used
139 -as the provider of hash functions for most of the Python software,
140 -including Portage and PkgCore.
141 +Since the effort needed to update the GLEP is small compared to the time
142 +needed for a new hash algorithm to be well-deployed, the GLEP needs
143 +to be updated prior to adding a new hash method.
144
145 -The basic rules for changing hash algorithms are defined in GLEP 59
146 -[#GLEP59]_. The implementations can focus only on those algorithms
147 -that are actually used or planned on being used. It may be feasible
148 -to devise a new GLEP that specifies the currently used hashes (or update
149 -GLEP 59 accordingly).
150 +The recommended naming is based off Python ``hashlib`` module,
151 +as most of the Gentoo tooling is written in Python. The names
152 +are transformed to match the historical naming used for hash functions
153 +in Manifests.
154 +
155 +Implementations are allowed to support and use only a subset of hashes
156 +listed in Manifest files. This could be used both to avoid the overhead
157 +of computing multiple hashes on non-performant systems, and to handle
158 +transition to new hashes gracefully.
159
160
161 Manifest compression
162 @@ -1072,27 +1110,29 @@ References
163 .. [#FILE-NAMING-RULES] Ebuild File Format -- Gentoo Development Guide
164 (https://devmanual.gentoo.org/ebuild-writing/file-format/#file-naming-rules)
165
166 -.. [#MD5] RFC1321: The MD5 Message-Digest Algorithm
167 - (https://www.ietf.org/rfc/rfc1321.txt)
168 +.. [#RFC7693] RFC 7693: The BLAKE2 Cryptographic Hash and Message Authentication
169 + Code (MAC)
170 + (https://www.rfc-editor.org/rfc/rfc7693)
171 +
172 +.. [#RFC1321] RFC 1321: The MD5 Message-Digest Algorithm
173 + (https://www.rfc-editor.org/rfc/rfc1321)
174
175 -.. [#RIPEMD160] The hash function RIPEMD-160
176 +.. [#RMD160] The hash function RIPEMD-160
177 (https://homes.esat.kuleuven.be/~bosselae/ripemd160.html)
178
179 .. [#SHS] FIPS PUB 180-4: Secure Hash Standard (SHS)
180 (http://nvlpubs.nist.gov/nistpubs/FIPS/NIST.FIPS.180-4.pdf)
181
182 -.. [#WHIRLPOOL] The WHIRLPOOL Hash Function
183 - (http://www.larc.usp.br/~pbarreto/WhirlpoolPage.html)
184 -
185 -.. [#BLAKE2] BLAKE2 -- fast secure hashing
186 - (https://blake2.net/)
187 -
188 .. [#SHA3] FIPS PUB 202: SHA-3 Standard: Permutation-Based Hash
189 and Extendable-Output Functions
190 (http://nvlpubs.nist.gov/nistpubs/FIPS/NIST.FIPS.202.pdf)
191
192 -.. [#STREEBOG] GOST R 34.11-2012: Streebog Hash Function
193 - (https://www.streebog.net/)
194 +.. [#RFC6986] RFC 6986: GOST R 34.11-2012: Hash Function
195 + (https://www.rfc-editor.org/rfc/rfc6986)
196 +
197 +.. [#BARRETO] Paulo S. L. M. Barreto, The WHIRLPOOL Hash Function
198 + (archived at 2017-11-29)
199 + (https://web.archive.org/web/20171129084214/http://www.larc.usp.br/~pbarreto/WhirlpoolPage.html)
200
201 .. [#C08] Cappos, J et al. (2008). "Attacks on Package Managers"
202 (https://www2.cs.arizona.edu/stork/packagemanagersecurity/attacks-on-package-managers.html)
203 --
204 2.37.3

Replies