查找损坏的.pdf文件 [英] To find the corrupted .pdf files
本文介绍了查找损坏的.pdf文件的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
大家好,
我有大约116222 .pdf文件。其中我需要找出损坏的文件。任何人都可以告诉我是否有任何软件(免费或付费)来获取那些被破坏的文件,反之亦然。我google了很多但找不到任何东西。所有结果都显示了修复软件。
任何建议对我都非常有帮助。
Hi All,
I have near about 116222 .pdf files. Among them I need to find out the corrupted files. Can any one please tell me is there any software (free or paid) to get those files which are corrupted or vice versa. I googled a lot but could not find any. All the result showing the fixing software.
Any suggestion will be very much helpful for me.
推荐答案
问题在于判断文件是否已损坏。
如果您没有每个文件的SHA哈希值,或类似的东西,那么唯一可以判断文件是否已损坏的方法是尝试将其作为PDF文件读取 - 如果不能,则表明它已损坏,或者使用读者软件的PDF规范的更高版本。
如果你能阅读它们,那么它们可能并没有腐败 - 你需要一个人来阅读它们并确保它们看起来像我应该怀疑的那样 - 所以你可以忽略它们。
我会通过阅读器处理它们,然后为它们设置一个SHA哈希值,以便下次可以立即检测到任何更改。
The problem is in deciding if the file is "corrupted".
If you don't have a SHA hash value for each file, or something similar, then the only way you can tell if the file is corrupted is to try to read it as a PDF file - if you can't then it is either corrupt, or uses a later version of the PDF specification that your reader software.
If you can read them, then they probably aren't corrupt - you would need a human to reader them and ensure they look as they should I suspect - so you could ignore them.
I would process them through a reader and then set up an SHA hash for them, so that any changes can be detected immediately next time.
尝试通过这些链接?
http://labs.appligent.com/presentations/recognizing_malformed_pdf_f.pdf [ ^ ]
http://arstechnica.com/civis/viewtopic.php?f=15&t=1134073 [ ^ ]
http://forums.techarena.in/tips-tweaks/1187473.htm [ ^ ]
http://answers.yahoo.com/question/index?qid=20110711001205AAFYri8 [ ^ ]
Tried going through these links?
http://labs.appligent.com/presentations/recognizing_malformed_pdf_f.pdf[^]
http://arstechnica.com/civis/viewtopic.php?f=15&t=1134073[^]
http://forums.techarena.in/tips-tweaks/1187473.htm[^]
http://answers.yahoo.com/question/index?qid=20110711001205AAFYri8[^]
对于仍在寻求解决arindamrudra问题的人来说,应该看一下这个免费的,开源的小程序,叫做'递归查找损坏的PDF文件'(下载链接: http://sourceforge.net/projects/corruptedpdfinder/ [ ^ ])这样做:在一个内容中找到递归损坏或受密码保护的PDF文件用户选择的文件夹。
祝你好运。
CSilva。
Hi,
For anyone still seeking a solution to arindamrudra problem should take a look at this free, open source and small program called 'Recursive finder of corrupted PDF files' (download link: http://sourceforge.net/projects/corruptedpdfinder/[^]) which will do just that: find recursively corrupted or password protected PDF files within a folder of a user's selection.
Good luck.
CSilva.
这篇关于查找损坏的.pdf文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文