Tesseract 3中词典的强度 [英] Strength of Dictionary in Tesseract 3

查看:67
本文介绍了Tesseract 3中词典的强度的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如何在tesseract 3中增加/减少字典的强度?

How do I increase/decrease the strength of the dictionary in tesseract 3 ?

在常见问题解答中说我需要更改"NON_WERD"的值,并且 "GARBAGE_STRING",但在Tesseract 3中不存在.

In the FAQ it says I need to change the value of "NON_WERD" and "GARBAGE_STRING" but they do not exist in Tesseract 3.

推荐答案

根据 http://code.google.com/p/tesseract-ocr/wiki/常见问题解答,您可以更改以下变量:

According to http://code.google.com/p/tesseract-ocr/wiki/FAQ, you change these variables:

enable_new_segsearch    1
language_model_penalty_non_freq_dict_word 0.2
language_model_penalty_non_dict_word 0.3

增加它们的值,使Tesseract更偏向于词典单词.

Increase their values to make Tesseract more biased to dictionary words.

注意:您必须设置enable_new_segsearch,否则它们将无效

Note: You must set enable_new_segsearch, otherwise they'll have no effect.

这篇关于Tesseract 3中词典的强度的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆