Imagemagick:生成PDF flate嵌入的原始图像数据? [英] Imagemagick: generate raw image data for PDF flate embedding?
问题描述
我试图想出一个命令行,PDF的源代码示例(另请参见如何生成在文档查看器中工作的纯文本源代码PDF示例? ),但嵌入图像。这是我有多远 - 我有一个模板 hello.pdf
:
I'm trying to come up with a command-line, source code example of a PDF (see also How to generate plain-text source-code PDF examples that work in a document viewer?), but with embedding an image. This is how far I got - I have a template hello.pdf
:
%PDF-1.4
1 0 obj
<< /Type /Catalog
/Outlines 2 0 R
/Pages 3 0 R
>>
endobj
2 0 obj
<< /Type /Outlines
/Count 0
>>
endobj
3 0 obj
<< /Type /Pages
/Kids [ 4 0 R ]
/Count 1
>>
endobj
4 0 obj
<< /Type /Page
/Parent 3 0 R
/MediaBox [ 0 0 612 792 ]
/Contents 5 0 R
/Resources << /ProcSet 6 0 R
/Font << /F1 7 0 R >>
/ProcSet [ /PDF /Text /ImageC ] /XObject << /Im1 8 0 R >>
>>
>>
endobj
5 0 obj
<< /Length 173 >>
stream
BT
/F1 24 Tf
100 100 Td
( Hello World ) Tj
200 200 Td
( Hello Again ) Tj
% width skew-right-up skew-top-right height x y
150 0 0 150 340 130 cm
% 150 150 Td - irrelevant for image
/Im1 Do
ET
endstream
endobj
6 0 obj
[ /PDF /Text ]
endobj
7 0 obj
<< /Type /Font
/Subtype /Type1
/Name /F1
/BaseFont /Helvetica
/Encoding /MacRomanEncoding
>>
endobj
8 0 obj
<<
/BitsPerComponent 8 /ColorSpace /DeviceRGB /DecodeParms
<<
/BitsPerComponent 8 /Colors 3 /Columns 150 /Predictor 10
>>
/Filter [ /FlateDecode ] /Height 150 /Length 1418 /Subtype /Image /Type /XObject /Width 150
>>
stream
###endstream
endobj
9 0 obj
[ /PDF ]
endobj
xref
0 10
0000000000 65535 f
0000000009 00000 n
0000000074 00000 n
0000000120 00000 n
0000000179 00000 n
0000000364 00000 n
0000000466 00000 n
0000000496 00000 n
0000001000 00000 n
0000001100 00000 n
trailer
<< /Size 10
/Root 1 0 R
>>
startxref
625
%%EOF
图像数据使用 convert
,并使用 zlib
应用flate压缩:
Then, I generate the image data using convert
, and I apply "flate" compression using zlib
:
convert -size 150x150 gradient:\#4b4-\#bfb test.ppm
du -b test.ppm # 135017 bytes
python -c "import zlib,sys;sys.stdout.write(zlib.compress(sys.stdin.read()))" < test.ppm > test.flate
du -b test.flate # 1418 bytes
/ Length 1418
,最后使用:
perl -ne 's/^###/`cat test.flate`/e;print' hello.pdf > hello2.pdf
这个文件显然有不正确的xref表,但是在 evince
很好:
This file obviously has incorrect xref table, however, opens in evince
just fine:
...但是,很明显,位图不是正确的格式。
... however, it's obvious that the bitmap is not in correct format.
我也尝试过生成
convert -size 150x150 gradient:\#4b4-\#bfb -endian LSB rgb:test.raw
...但是那些甚至不是一般的绿色(原始图像应该是)。
... but those are not even generally green (as the original image should be).
是否有人知道正确的图片格式 - / code>命令行 - 生成可以flated并包含在pdf中的原始图像?
Does anyone know the correct image format - and the convert
command line - to generate a raw image that can be "flated" and included in a pdf?
非常感谢您提供任何答案,
干杯!
Many thanks in advance for any answers,
Cheers!
推荐答案
好的,修复它;问题是,必须在 convert
命令行中指定8位深度;因此正确的调用是:
Ok, fixed it; the problem was that one had to specify 8-bit depth in the convert
command line; thus the correct invocation is:
convert -depth 8 -size 150x150 gradient:\#4b4-\#bfb rgb:test.raw
然后我们有:
du -b test.raw # 67500 bytes
python -c "import zlib,sys;sys.stdout.write(zlib.compress(sys.stdin.read()))" < test.raw > test.flate
du -b test.flate # 664 bytes
# replace /Length 664, and then:
perl -ne 's/^###/`cat test.flate`/e;print' hello.pdf > hello2.pdf
最后, hello2.pdf
在 evince
中打开并正确显示位图:
Finally, the hello2.pdf
opens in evince
and displays the bitmap correctly:
Btw,我发现这是因为我实际上试图调试另一个文档中的图像;所以我基本上做了以下:
Btw, I found this because I'm actually trying to debug an image in another document; so I basically did the following:
# extract and save the stream of this image object
qpdf --show-object=23 --raw-stream-data mybadfile.pdf > myraw.file
# get raw binary data - deflate the saved object stream
python -c "import zlib,sys;sys.stdout.write(zlib.decompress(sys.stdin.read()))" < myraw.file > myraw.deflate
identify myraw.deflate
# identify: no decode delegate for this image format `myraw.deflate' @ constitute.c/ReadImage/530.
identify rgb:myraw.deflate
# identify: Must specify image size `myraw.deflate' @ rgb.c/ReadRGBImage/155.
identify -size 588x508 rgb:myraw.deflate
# rgb:myraw.deflate=>myraw.deflate RGB 588x508 588x508+0+0 16-bit TrueColor DirectClass 875KiB 0.020u 0:00.030
# identify: Unexpected end-of-file `myraw.deflate': No such file or directory @ rgb.c/ReadRGBImage/261.
display -size 588x508 rgb:myraw.deflate
# display: Unexpected end-of-file `myraw.deflate': No such file or directory @ rgb.c/ReadRGBImage/261. ### but it shows correctly, except for size?
identify -depth 8 -size 588x508 rgb:myraw.deflate
# rgb:myraw.deflate=>myraw.deflate RGB 588x508 588x508+0+0 8-bit TrueColor DirectClass 875KiB 0.020u 0:00 ## OK
display -depth 8 -size 588x508 rgb:myraw.deflate
# OK; choosing rgba: is already bad - so confirmed 8-bit rgb
希望这有助于某人,
干杯!
Hope this helps someone,
Cheers!
这篇关于Imagemagick:生成PDF flate嵌入的原始图像数据?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!