可靠的方法(编程)比较PDF文件? [英] Reliable way to (programmatically) compare PDFs?

查看:146
本文介绍了可靠的方法(编程)比较PDF文件?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

可能重复:结果
工具比较大量的PDF文件?

我在经典场景,业务给你一堆的新的的PDF在新的一年,没有修改的形式任何笔记,你应该弄清楚什么是从以前的不同一年的。

I am in the classic scenario where the business gives you a bunch of new pdf forms for the new year with no revision notes whatsoever and you are supposed to figure out what's different from the previous year ones.

我在这里讲的形式加载,所以我试图找到一种方法来比较PDF文件勾勒出差异,而无需人来手动完成对每其中之一。

I am talking loads of forms here, so I am trying to find a way to compare PDFs to outline differences without having people to manually go through each and every one of them.

我的想法是提取所有从PDF文件的文本,并将其倾倒入.txt然后运行文本文件的差异,但它听起来太可怕了。

My idea was to extract all the text from the PDFs and dump it into a .txt then run differences on text files, but it sounds horrible.

我的问题说编程方式,但我很高兴与比较PDF文件,主要希望得到经验的人的想法任何可靠的工具。也愿意接受任何程序化解决方案(最好在C#中,但请拍出来的任何的想法)。

My question says programmatically, but I'd be happy with any reliable tools for comparing PDFs, and mainly looking to get an idea from people experiences. Also willing to entertain any programmatic solutions (preferably in C# but pls shoot out any ideas).

推荐答案

有相当声称差异PDF文件的几个软件产品。我从来没有需要使用一次,但如果这将是一个反复的过程,我认为它会是明智的,为贵公司在其中的一个投资。只是谷歌的PDF差异为一堆潜在应用。

There is quite a few software products that claim to diff pdfs. I've never had need to use one but if this is going to be a recurring process I think it'd be wise for your company to invest in one of them. Just Google "pdf diff" for a bunch of potential applications.

此外,你的情况很相似,这个问题:的 http://stackoverflow.com/questions/145657/how-to-compare-two-pdf-files 我想讨论可能的帮助。

Additionally, your situation is very similar to this question: http://stackoverflow.com/questions/145657/how-to-compare-two-pdf-files I think its discussion may help.

这篇关于可靠的方法(编程)比较PDF文件?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆