使用Tesseract进行手写识别 [英] Using Tesseract for handwriting recognition

查看:455
本文介绍了使用Tesseract进行手写识别的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我只是想知道,如果将tesseract的笔迹以小写形式全部用在自己的小盒子中,那么手写识别的准确性如何?

I was just wondering how accurate can tesseract be for handwriting recognition if used with capital letters all in their own little boxes in a form.

我知道您可以训练它以某种方式识别您自己的笔迹,但就我而言,问题是我需要在多个笔迹中使用它.谁能指出我正确的方向?

I know you can train it to recognise your own handwriting somewhat but the problem in my case is I need to use it across multiple handwritings. Can anyone point me in the right direction?

非常感谢.

推荐答案

简而言之,您将必须训练Tesseract引擎来识别笔迹.看一下此链接:

In short, you would have to train the Tesseract engine to recognize the handwriting. Take a look at this link:

带字典训练的Tesseract手写

这是链接帖子所说的:

可以训练tesseract识别笔迹.这是 说明: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract

It's possible to train tesseract to recognize handwriting. Here are the instructions: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract

但是不要期望效果很好.学者通常已经 准确度结果最高可达90%.这里有一些参考 用于单词和数字.因此,如果您的用例可以处理至少1/10 错误,这可能对您有用.

But don't expect very good results. Academics have typically gotten accuracy results topping out about 90%. Here are a couple references for words and numbers. So if your use case can deal with at least 1/10 errors, this might work for you.

这是关于这个主题的一篇很好的学术文章:

Also here is a good academic article written on this subject:

使用Tesseract识别手写文字注释 即时信息(iJIT)的开源OCR引擎

Recognition of Handwritten Textual Annotations using Tesseract Open Source OCR Engine for information Just In Time (iJIT)

这篇关于使用Tesseract进行手写识别的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆