Fw:用于阅读PDF文件的PDF库 [英] Fw: PDF library for reading PDF files
问题描述
嗨!
我正在寻找一个可以阅读PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可以用来写PDF文件的库。
有什么想法吗?
彼得>
推荐答案
>我正在寻找一个可以读取PDF文件的Python库,而我
> I am looking for a library in Python that would read PDF files and I
可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可用于编写PDF文件的库。
could extract information from the PDF with it. I have searched with
google, but only found libraries that can be used to write PDF files.
reportlab有一个名为pagecatcher的库;它完全支持python,
它不是免费的。
Harald
reportlab has a lib called pagecatcher; it is fully supported with python,
it is not free.
Harald
" Peter Galfi < GA **** @ freestart.hu>在消息新闻中写道:< ma ************************************** @ pyt hon。 org> ...
"Peter Galfi" <ga****@freestart.hu> wrote in message news:<ma**************************************@pyt hon.org>...
我正在寻找一个可以阅读PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索了,但只找到了可用于编写PDF文件的库。
任何想法?
I am looking for a library in Python that would read PDF files and I
could extract information from the PDF with it. I have searched with
google, but only found libraries that can be used to write PDF files.
Any ideas?
我很快就通过谷歌搜索了一下,但我确切地知道我在找什么?
寻找:;-)
http://groups.google.com/groups?selm...ing.google.com
提到的页面在这里:
http://www.boddie.org.uk/david/Proje...thon/pdftools/
该模块非常正在进行中。您可以从一些文档中获得一些文本和位图图像,但是除非您想要改进它,否则这可能是您所期望的全部(&b) br />
提交补丁)。
祝你好运!
David
I quickly searched back through Google, but I knew exactly what I was
looking for: ;-)
http://groups.google.com/groups?selm...ing.google.com
The page referred to is here:
http://www.boddie.org.uk/david/Proje...thon/pdftools/
The module is very much a "work in progress". You can probably get
some text and bitmap images out of a few documents, but that''s
probably all you can expect unless you want to improve it (and
submit patches).
Good luck!
David
>
在文章< Xn ********************************** @ 62.153.159.1 34>中,
Harald Massa< cp ********* @ spamgourmet.com>写道:
In article <Xn**********************************@62.153.159.1 34>,
Harald Massa <cp*********@spamgourmet.com> wrote:
我正在寻找一个可以读取PDF文件的Python库,我可以用它从PDF中提取信息。我用谷歌搜索过,但只找到了可用于编写PDF文件的库。
I am looking for a library in Python that would read PDF files and I
could extract information from the PDF with it. I have searched with
google, but only found libraries that can be used to write PDF files.
reportlab有一个名为pagecatcher的库;它完全支持python,
它不是免费的。
Harald
reportlab has a lib called pagecatcher; it is fully supported with python,
it is not free.
Harald
ReportLab的库很棒 - - 但是他们没有从PDF中提取
信息。从某种意义上说,我相信原来的
提问者的意图。正如安德烈亚斯建议的那样,他可能最好使用现有的独立应用程序作为单独的进程,使用Python控制
。
- -
Cameron Laird< cl **** @ phaseit.net>
业务: http://www.Phaseit.net
这篇关于Fw:用于阅读PDF文件的PDF库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!