OCR处理之前的图像预处理 [英] Image Preprocessing before OCR process

查看：901 发布时间：2018/7/30 17:02:52 image-processing ocr tesseract

本文介绍了OCR处理之前的图像预处理的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我目前的项目涉及将pdf中的文本转录为文本文件，我首先尝试将图像文件直接放入OCR程序（tesseract），但它没有做得那么好。
原始图像文件基本上都是旧的新闻报道，并且有一些背景噪音，我相信tesseract有问题。因此，我尝试使用一些图像预处理，然后将其输入tesseract。是否有任何建议适合这种情况的开源图像预处理引擎???关于如何使用它的说明将更加受到赞赏！

My current project involves transcribing texts in pdf into text files, and I first tried putting the image file directly into OCR program (tesseract) and it didnt' do that well. The original image files are old news papers, basically, and have some background noises, which I am sure tesseract has problem with. So I am trying to use some image preprocessing before feeding it into tesseract. Is there any suggestion for open source image preprocessing engine that fits well to this situation??? And instructions on how to use it would be even more appreciated !

OCR处理之前的图像预处理 [英] Image Preprocessing before OCR process

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

OCR处理之前的图像预处理 [英] Image Preprocessing before OCR process

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭