Gentoo Archives: gentoo-dev

From: Francesco Riosa <vivo@g.o>
To: gentoo-dev@l.g.o
Subject: Re: [gentoo-dev] Having fun with compression
Date: Tue, 02 May 2006 15:30:39
Message-Id: 44577BCC.5080604@gentoo.org
In Reply to: Re: [gentoo-dev] Having fun with compression by "Robin H. Johnson"
1 Robin H. Johnson wrote:
2 > On Sun, Apr 30, 2006 at 06:30:23PM +0200, Patrick Lauer wrote:
3 >> We have ~15k .tar.gz in distfiles. ~6500 .tar.bz2, ~2000 others.
4 >> A short run over 477 distfiles spanning 833M gave me 586M of .tar.bz2 -
5 >> roughly 30% more efficient!
6 >> A comparison run with 7zip gave me 590M files, so bzip2 seems to be
7 >> quite good.
8 > Try rzip, esp. on the larger files, and see a serious improvement, with
9 > the cost of one major penalty [*].
10 >
11 > * rzip cannot handle streams, it seeks across the file multiple times
12 > for what it does.
13 >
14
15 /me for fun too, values are consistnt between various run
16
17 foreach of:
18 export CMD='rzip -9 gcc-4.2-20060429.tar'
19 export CMD='rzip -d gcc-4.2-20060429.tar.rz'
20 export CMD='bzip2 -d gcc-4.2-20060429.tar.bz2'
21 export CMD='bzip2 -9 gcc-4.2-20060429.tar'
22 export CMD='gzip gcc-4.2-20060429.tar'
23 export CMD='gunzip gcc-4.2-20060429.tar.gz'
24
25 $CMD &>/dev/null & \
26 J=$(jobs -l 1 | cut -c 6- ) ; J=${J% Running*} \
27 ; while [[ -d /proc/${J} ]] ; do sleep 0.05 ; echo -n "$J " ; grep
28 VmPeak /proc/${J}/status ; done
29
30
31 file: gcc-4.2-20060429.tar size 268160 kB
32
33 Compression:
34 rzip -9
35 VmPeak 345368 kB
36 Size 34437.87 kB
37 bzip2 -9
38 VmPeak 9224 kB
39 Size 38024.95 kB
40 gzip
41 VmPeak 1940 kB
42 Size 50368.28 kB
43
44
45 De-compression:
46 bzip2 -d
47 VmPeak 5448 kB
48 rzip -d
49 VmPeak 7892 kB
50 gunzip
51 VmPeak 1940 kB
52
53 --
54 gentoo-dev@g.o mailing list