最佳OCR算法 [英] best ocr algorithm

查看:212
本文介绍了最佳OCR算法的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

什么是用于通过移动设备从图像提取文本的最佳ocr算法
camera?

what is the best ocr algorithm used to extract text from image by mobile
camera?

推荐答案

这取决于您的应用程序要求.

通常,OCR引擎只能返回字符,字体,单词,行或区域信息.您需要添加许多其他模块并对其进行调整以获得最佳结果.

首先,图像预处理对于相机OCR应用来说是必需的.背景,噪声消除,二值化,调整大小...应尽可能添加到干净的图像中.

其次,您需要找到文本区域.该算法不仅取决于图像特征,还取决于您的OCR物镜.例如,如果您需要OCR板号,则可以使用板号的长度,宽度或高度来获得更准确的位置.

在OCR之后,您可以添加一些后期处理以更正某些OCR错误.常见的OCR引擎,例如 Abyy ExperVision ,Omnipage ,GOCR,Tesseract均根据通用文件,杂志或办公用纸进行培训.如果您的文档很特殊,则可以在OCR层上建立高级数据提取或分析逻辑.
That depends on your application requirements.

Usually OCR engine can only return characters, font, word, line or region information. You need add many other modules and tune them to get the best results.

First, image preprocessing is necessary for camera OCR application. Background, noise removal, binarization, resize ... should be added to clean image as possible as you can.

Secondly, you need locate the text region. The algorithm not only depends on image features but also on your OCR objective. For example, if you need to OCR plate number, you can use the plate number''s length, width or height for more accurate location.

After OCR, you can add some post processing to correct some OCR errors. Common OCR engines such as Abyy, ExperVision, Omnipage, GOCR, Tesseract are all trained according to common documents, magazine or office paper. If your documents are special, you can establish your high-level data extraction or analysis logic over OCR layer.


这篇关于最佳OCR算法的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆