Gentoo Archives: gentoo-user

From: Florian Philipp <lists@×××××××××××.net>
To: gentoo-user@l.g.o
Subject: Re: [gentoo-user] PDF puzzle : files available
Date: Sat, 14 Jan 2012 11:23:29
Message-Id: 4F11654B.6070503@binarywings.net
In Reply to: Re: [gentoo-user] PDF puzzle : files available by Philip Webb
1 Am 14.01.2012 04:21, schrieb Philip Webb:
2 > 120113 Florian Philipp wrote:
3 >> Am 13.01.2012 11:32, schrieb Philip Webb:
4 >>> I've been in the habit of creating PDFs of .odt files using LibreOffice.
5 >>> I was reminded of 'ps2pdf', which resulted in :
6 >>> -rw-r--r-- 1 purslow users 34475 Jan 12 07:52 boox.odt
7 >>> -rw-r--r-- 1 purslow users 366711 Jan 13 04:05 boox.pdf
8 >>> -rw-r--r-- 1 purslow purslow 830909 Jan 13 04:07 boox.ps
9 >>> -rw-r--r-- 1 purslow purslow 67184 Jan 13 04:07 boox-test.pdf
10 >>> Both PDFs seem the same when opened with Xpdf or Okular,
11 >>> but the PDF created with LO direct export is 5 times
12 >>> the size of the PDF created with a test run of Ps2pdf ;
13 >>> the .ps file created with LO & then used for the latter is even bigger.
14 >>> Can anyone explain what is going on ?
15 >> Try the pdfdebugger provided by dev-java/pdfbox to inspect both files.
16 >
17 > That needs Java, which I am definitely not going to re-install (smile).
18 >
19
20 Well, I'll resist the temptation to start a flamewar over this ;)
21
22 >> Maybe one of them uses a stupid internal structure.
23 >
24 > OpenOffice was notorious for legacy bloat, so that would make sense.
25 > LO devs have managed to remove a large amount of useless code.
26 >
27 >> I couldn't reproduce this with app-office/libreoffice-3.4.3.2-r1
28 >> & cups-pdf , which should use the same backend as ps2pdf .
29 >> the files were nearly equal in size, c 60 kB for about 1 page of text.
30 >
31 > My file has 14 pp of simple text. I've now uploaded the files above
32 > & everyone can inspect them & their structure at
33 >
34 > http://www.chass.utoronto.ca/~purslow/test/
35 >
36 > I've renamed the PDFs to show their origin, ie LibreOffice + Ghostscript.
37 > Perhaps that will allow others to investigate.
38 >
39 > I will also have a look at Cups-pdf & at Willie Wong's suggestions.
40 >
41 > PS the file is a list of books I've noted as worth reading,
42 > so it may have a wider (OT) interest than simply to solve this problem.
43 >
44
45 I can reproduce the behavior with my LO.
46
47 I've inspected the files with pdfdebugger. The LO-version really
48 contains more, but nothing which seems to justify the difference. In
49 fact, the content streams of each page seem to be better compressed in
50 th LO version.
51
52 Cups-PDF creates a smaller PDF than ps2pdf, probably because it outputs
53 PDF-1.5. Otherwise it is identical to the other ghostscript outputs.
54
55 Out of curiosity, I removed all pictures from an old report I wrote (25
56 pages) and tested that. There, Cups-PDF creates larger files than LO
57 although the internal structure is similar to what you've provided.
58
59 So I guess, all we can say is that their performance is inconsistent.
60
61 Regards,
62 Florian Philipp

Attachments

File name MIME type
signature.asc application/pgp-signature

Replies

Subject Author
Re: [gentoo-user] PDF puzzle Philip Webb <purslow@××××××××.net>