以编程方式阅读,突出显示,保存PDF [英] read, highlight, save PDF programmatically

查看:67
本文介绍了以编程方式阅读,突出显示,保存PDF的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想编写一个小的脚本(将在无头Linux服务器上运行),该脚本读取PDF,突出显示与我传递的字符串数组中的任何内容匹配的文本,然后保存修改后的PDF.我想我最终会使用类似与poppler的Python绑定之类的东西,但是不幸的是零文档和我在python方面几乎没有零经验.

I'd like to write a small script (which will run on a headless Linux server) that reads a PDF, highlights text that matches anything in an array of strings that I pass, then saves the modified PDF. I imagine I'll end up using something like the python bindings to poppler but unfortunately there's next to zero documentation and I have next to zero experience in python.

如果任何人都可以指出我的教程,示例或一些有用的文档来帮助我入门,那么将不胜感激!

If anyone could point me to a tutorial, example, or some helpful documentation to get me started it would be greatly appreciated!

推荐答案

您是否尝试过查看 PDFMiner ?听起来像是您想要的.

Have you tried looking at PDFMiner? It sounds like it does what you want.

这篇关于以编程方式阅读,突出显示,保存PDF的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆