查找损坏的.pdf文件 [英] To find the corrupted .pdf files

查看:180
本文介绍了查找损坏的.pdf文件的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

大家好,



我有大约116222 .pdf文件。其中我需要找出损坏的文件。任何人都可以告诉我是否有任何软件(免费或付费)来获取那些被破坏的文件,反之亦然。我google了很多但找不到任何东西。所有结果都显示了修复软件。



任何建议对我都非常有帮助。

Hi All,

I have near about 116222 .pdf files. Among them I need to find out the corrupted files. Can any one please tell me is there any software (free or paid) to get those files which are corrupted or vice versa. I googled a lot but could not find any. All the result showing the fixing software.

Any suggestion will be very much helpful for me.

推荐答案

问题在于判断文件是否已损坏。



如果您没有每个文件的SHA哈希值,或类似的东西,那么唯一可以判断文件是否已损坏的方法是尝试将其作为PDF文件读取 - 如果不能,则表明它已损坏,或者使用读者软件的PDF规范的更高版本。



如果你能阅读它们,那么它们可能并没有腐败 - 你需要一个人来阅读它们并确保它们看起来像我应该怀疑的那样 - 所以你可以忽略它们。



我会通过阅读器处理它们,然后为它们设置一个SHA哈希值,以便下次可以立即检测到任何更改。
The problem is in deciding if the file is "corrupted".

If you don't have a SHA hash value for each file, or something similar, then the only way you can tell if the file is corrupted is to try to read it as a PDF file - if you can't then it is either corrupt, or uses a later version of the PDF specification that your reader software.

If you can read them, then they probably aren't corrupt - you would need a human to reader them and ensure they look as they should I suspect - so you could ignore them.

I would process them through a reader and then set up an SHA hash for them, so that any changes can be detected immediately next time.


尝试通过这些链接?

http://labs.appligent.com/presentations/recognizing_malformed_pdf_f.pdf [ ^ ]

http://arstechnica.com/civis/viewtopic.php?f=15&t=1134073 [ ^ ]

http://forums.techarena.in/tips-tweaks/1187473.htm [ ^ ]

http://answers.yahoo.com/question/index?qid=20110711001205AAFYri8 [ ^ ]
Tried going through these links?
http://labs.appligent.com/presentations/recognizing_malformed_pdf_f.pdf[^]
http://arstechnica.com/civis/viewtopic.php?f=15&t=1134073[^]
http://forums.techarena.in/tips-tweaks/1187473.htm[^]
http://answers.yahoo.com/question/index?qid=20110711001205AAFYri8[^]




对于仍在寻求解决arindamrudra问题的人来说,应该看一下这个免费的,开源的小程序,叫做'递归查找损坏的PDF文件'(下载链接: http://sourceforge.net/projects/corruptedpdfinder/ [ ^ ])这样做:在一个内容中找到递归损坏或受密码保护的PDF文件用户选择的文件夹。



祝你好运。

CSilva。
Hi,
For anyone still seeking a solution to arindamrudra problem should take a look at this free, open source and small program called 'Recursive finder of corrupted PDF files' (download link: http://sourceforge.net/projects/corruptedpdfinder/[^]) which will do just that: find recursively corrupted or password protected PDF files within a folder of a user's selection.

Good luck.
CSilva.


这篇关于查找损坏的.pdf文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆