Fw:用于阅读PDF文件的PDF库 [英] Fw: PDF library for reading PDF files

查看:506
本文介绍了Fw:用于阅读PDF文件的PDF库的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

嗨!


我正在寻找一个可以阅读PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可以用来写PDF文件的库。


有什么想法吗?


彼得

推荐答案

>我正在寻找一个可以读取PDF文件的Python库,而我
> I am looking for a library in Python that would read PDF files and I
可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可用于编写PDF文件的库。
could extract information from the PDF with it. I have searched with
google, but only found libraries that can be used to write PDF files.



reportlab有一个名为pagecatcher的库;它完全支持python,

它不是免费的。


Harald



reportlab has a lib called pagecatcher; it is fully supported with python,
it is not free.

Harald


" Peter Galfi < GA **** @ freestart.hu>在消息新闻中写道:< ma ************************************** @ pyt hon。 org> ...
"Peter Galfi" <ga****@freestart.hu> wrote in message news:<ma**************************************@pyt hon.org>...
我正在寻找一个可以阅读PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索了,但只找到了可用于编写PDF文件的库。

任何想法?
I am looking for a library in Python that would read PDF files and I
could extract information from the PDF with it. I have searched with
google, but only found libraries that can be used to write PDF files.

Any ideas?




我很快就通过谷歌搜索了一下,但我确切地知道我在找什么?
寻找:;-)

http://groups.google.com/groups?selm...ing.google.com


提到的页面在这里:

http://www.boddie.org.uk/david/Proje...thon/pdftools/


该模块非常正在进行中。您可以从一些文档中获得一些文本和位图图像,但是除非您想要改进它,否则这可能是您所期望的全部(&b) br />
提交补丁)。


祝你好运!


David



I quickly searched back through Google, but I knew exactly what I was
looking for: ;-)

http://groups.google.com/groups?selm...ing.google.com

The page referred to is here:

http://www.boddie.org.uk/david/Proje...thon/pdftools/

The module is very much a "work in progress". You can probably get
some text and bitmap images out of a few documents, but that''s
probably all you can expect unless you want to improve it (and
submit patches).

Good luck!

David

在文章< Xn ********************************** @ 62.153.159.1 34>中,

Harald Massa< cp ********* @ spamgourmet.com>写道:
In article <Xn**********************************@62.153.159.1 34>,
Harald Massa <cp*********@spamgourmet.com> wrote:
我正在寻找一个可以读取PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可用于编写PDF文件的库。
I am looking for a library in Python that would read PDF files and I
could extract information from the PDF with it. I have searched with
google, but only found libraries that can be used to write PDF files.



reportlab有一个名为pagecatcher的库;它完全支持python,
它不是免费的。

Harald



reportlab has a lib called pagecatcher; it is fully supported with python,
it is not free.

Harald




ReportLab的库很棒 - - 但是他们没有从PDF中提取

信息。从某种意义上说,我相信原来的

提问者的意图。正如安德烈亚斯建议的那样,他可能最好使用现有的独立应用程序作为单独的进程,使用Python控制


- -


Cameron Laird< cl **** @ phaseit.net>

业务: http://www.Phaseit.net


这篇关于Fw:用于阅读PDF文件的PDF库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆