1 |
Robin H. Johnson wrote: |
2 |
> On Sun, Apr 30, 2006 at 06:30:23PM +0200, Patrick Lauer wrote: |
3 |
>> We have ~15k .tar.gz in distfiles. ~6500 .tar.bz2, ~2000 others. |
4 |
>> A short run over 477 distfiles spanning 833M gave me 586M of .tar.bz2 - |
5 |
>> roughly 30% more efficient! |
6 |
>> A comparison run with 7zip gave me 590M files, so bzip2 seems to be |
7 |
>> quite good. |
8 |
> Try rzip, esp. on the larger files, and see a serious improvement, with |
9 |
> the cost of one major penalty [*]. |
10 |
> |
11 |
> * rzip cannot handle streams, it seeks across the file multiple times |
12 |
> for what it does. |
13 |
> |
14 |
|
15 |
/me for fun too, values are consistnt between various run |
16 |
|
17 |
foreach of: |
18 |
export CMD='rzip -9 gcc-4.2-20060429.tar' |
19 |
export CMD='rzip -d gcc-4.2-20060429.tar.rz' |
20 |
export CMD='bzip2 -d gcc-4.2-20060429.tar.bz2' |
21 |
export CMD='bzip2 -9 gcc-4.2-20060429.tar' |
22 |
export CMD='gzip gcc-4.2-20060429.tar' |
23 |
export CMD='gunzip gcc-4.2-20060429.tar.gz' |
24 |
|
25 |
$CMD &>/dev/null & \ |
26 |
J=$(jobs -l 1 | cut -c 6- ) ; J=${J% Running*} \ |
27 |
; while [[ -d /proc/${J} ]] ; do sleep 0.05 ; echo -n "$J " ; grep |
28 |
VmPeak /proc/${J}/status ; done |
29 |
|
30 |
|
31 |
file: gcc-4.2-20060429.tar size 268160 kB |
32 |
|
33 |
Compression: |
34 |
rzip -9 |
35 |
VmPeak 345368 kB |
36 |
Size 34437.87 kB |
37 |
bzip2 -9 |
38 |
VmPeak 9224 kB |
39 |
Size 38024.95 kB |
40 |
gzip |
41 |
VmPeak 1940 kB |
42 |
Size 50368.28 kB |
43 |
|
44 |
|
45 |
De-compression: |
46 |
bzip2 -d |
47 |
VmPeak 5448 kB |
48 |
rzip -d |
49 |
VmPeak 7892 kB |
50 |
gunzip |
51 |
VmPeak 1940 kB |
52 |
|
53 |
-- |
54 |
gentoo-dev@g.o mailing list |