Tesseract OCR 如何改善结果? [英] Tesseract OCR How do I improve result?

查看:58
本文介绍了Tesseract OCR 如何改善结果?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我在使用 Tesseract 时遇到了困难,有没有办法提高准确性?如果需要,我如何为自己训练?

I am having a hard time working with Tesseract, is there a way to improve the accuracy? How do I train it for myself, if needed?

我唯一要做的就是阅读以下字符,XYZ:-0123456789就是这样!图片总是那样.

the only thing I am doing is reading the following characters, XYZ:-0123456789 that's it! The pictures always look that way.

谢谢!

推荐答案

Tesseract 4.00alpha 与您的图像的输出是

The output of Tesseract 4.00alpha with your image is

$ tesseract ICKcj.png - -l eng
*: 4606 Y; 4809 Z; 698

Warning. Invalid resolution 0 dpi. Using 70 instead.

将图片重新采样为 50% 并将 dpi 设置为 300:

Resample the picture to 50% and setting the dpi to 300:

这个图像的输出稍微好一点,警告消失了:

The output with this image is slightly better and the warning is vanishing:

$ tesseract ICKcj-50.png - -l eng
X: 4606 Y: 4809 Z: 698

唯一缺少的是减号,它们打印的非常不规则(图片中更好的分辨率可能会有所帮助).也可以在 tesseract 中限制输出模式.或者,您可以尝试根据 X、Y、Z 和数字之间的空格来猜测减号.

The only thing missing are the minus signs, which are printed quite irregular (a better resolution in the picture could help). It is also possible to restrict the output pattern in tesseract. Alternatively, you can try to guess the minus afterwards depending on the spaces between the X, Y, Z and the numbers.

这篇关于Tesseract OCR 如何改善结果?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆