Pdf来词翻译dll [英] Pdf To word coversion dll
本文介绍了Pdf来词翻译dll的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我想要一个免费的完整版pdf到单词转换dll ..
I want a free full version pdf to word conversion dll..
推荐答案
你可能想要这样的事情。但你不会得到任何。
PDF是一种文档格式,但它并不是真正的格式。它是一种特殊的封装PostScript文件,旨在成为打印标准。因此,PDF文件被构建为在打印设备上呈现,而不是作为文本编辑。你可能会发现一个单词在文件中有字母作为独立对象在语义上与任何形式的彼此无关。
所以你唯一的机会是:
1)将PDF渲染成图像
2)使用OCR工具处理图像
3)将结果转换为单词。
如果你想要开源/免费解决方案,你可能会找到一些独立的工具来完成这些步骤。
但如果你没有时间和/或者让这些组件协同工作的知识,我建议你寻找涵盖所有步骤的商业解决方案。我建议你看看这个: http://www.abbyy.com/ocr_sdk_windows/technical_specifications/ [ ^ ]
You might want such thing. But you won't get any.
A PDF is a "document" format, but it isn't really one. It is a special encapsulated PostScript file, which is meant to be a printing standard. Thus a PDF file is built to be rendered on a printing device, not to be edited as text. You might find a word having it's letters in the file as independent objects semantically not related to each-other in any form.
So your only chance is to:
1) render the PDF to an image
2) process the image with an OCR tool
3) convert the result to word.
If you want open source / free solutions you might find some independent tools for these steps.
But if you don't have the time and/or the knowledge to get those components working together, I suggest you look for a commercial solution covering all steps. I suggest you look at this one: http://www.abbyy.com/ocr_sdk_windows/technical_specifications/[^]
这篇关于Pdf来词翻译dll的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文