OCR处理之前的图像预处理 [英] Image Preprocessing before OCR process

查看:901
本文介绍了OCR处理之前的图像预处理的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我目前的项目涉及将pdf中的文本转录为文本文件,我首先尝试将图像文件直接放入OCR程序(tesseract),但它没有做得那么好。
原始图像文件基本上都是旧的新闻报道,并且有一些背景噪音,我相信tesseract有问题。因此,我尝试使用一些图像预处理,然后将其输入tesseract。是否有任何建议适合这种情况的开源图像预处理引擎???关于如何使用它的说明将更加受到赞赏!

My current project involves transcribing texts in pdf into text files, and I first tried putting the image file directly into OCR program (tesseract) and it didnt' do that well. The original image files are old news papers, basically, and have some background noises, which I am sure tesseract has problem with. So I am trying to use some image preprocessing before feeding it into tesseract. Is there any suggestion for open source image preprocessing engine that fits well to this situation??? And instructions on how to use it would be even more appreciated !

推荐答案

我从未听说过图像预处理引擎为此目的,但您可以查看 OpenCV (开源计算机视觉库)和实现自己的预处理引擎 OpenCV 是一个计算机视觉库,提供许多功能来执行图像处理。

I never heard of an "image preprocessing engine" for that purpose, but you can take a look at OpenCV (Open Source Computer Vision Library) and implement your own "pre-processing engine". OpenCV is a computer vision library that offers many features to perform image processing.

您可能需要测试的一个有趣的事情是预处理步骤,将阈值应用于图像以消除噪音和东西。无论如何,我在此主题中讨论了这类内容

One interesting thing you might want test as a preprocessing step is apply a threshold to the image to remove noises and stuff. Anyway, I've talked about this kind of stuff in this thread.

这篇关于OCR处理之前的图像预处理的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆