有什么方法可以改善小字体的tesseract OCR? [英] Is there any way to improve tesseract OCR with small fonts?

查看：105 发布时间：2020/5/19 19:24:36 ocr tesseract python-imaging-library

本文介绍了有什么方法可以改善小字体的tesseract OCR?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试通过python-tesseract使用tesseract-OCR来读取低分辨率字体，如下所示:

I'm trying to use tesseract-OCR via python-tesseract to read a low resolution font that looks like this:

不幸的是，该图像返回了

Unfortunately that image returns

ZIJZHZI

我认为分辨率太低，这会引起问题.我已经尝试过放大图像，并将其裁剪为单个字符，但是这些都不能提供很大的改进.还有什么我应该考虑做的事情，最好是可以使用Python Imaging Library完成的事情?或者我应该放弃/训练tesseract.

I think the resolution is too low and that is causing problems. I've tried magnifying the image, and cropping it down to individual characters, but neither of these provide much improvement. Is there anything else I should consider doing, preferably something that could be done using the Python Imaging Library? Or should I just give up/train tesseract.

对于它的价值，PIL具有以下内置过滤器:

For what it's worth, the PIL has the following built in filters:

蓝色，轮廓，细节，边缘增强，
EDGE_ENHANCE_MORE，EMBOSS，FIND_EDGES，
SMOOTH，SMOOTH_MORE和SHARPEN

BLUR, CONTOUR, DETAIL, EDGE_ENHANCE,
EDGE_ENHANCE_MORE, EMBOSS, FIND_EDGES,
SMOOTH, SMOOTH_MORE, and SHARPEN

有什么方法可以改善小字体的tesseract OCR? [英] Is there any way to improve tesseract OCR with small fonts?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

有什么方法可以改善小字体的tesseract OCR? [英] Is there any way to improve tesseract OCR with small fonts?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭