合并经过训练的数据文件-Tesseract [英] Merge trained data files - Tesseract
问题描述
我正在tesseract中使用两个训练有素的数据文件,以便识别两种语言.但是由于准确性不够,我训练了tesseract并生成了一个新的训练数据文件,我想将其与我使用的两种语言文件之一合并.所以我的问题是:怎么可能将新的训练数据文件与以下文件之一合并:
I'm using two traineddata files in tesseract in order to recognize two languages. But because the accuracy wasn't good enough, I trained tesseract and produce a new traineddata file which I want to merge it with one of the two language files I use. So my question is: How can it be possible to merge the new traineddata file with one of the files that is found here: https://code.google.com/p/tesseract-ocr/downloads/list .Any help?
推荐答案
您可以解压缩现有的.traineddata
并分别合并各个组件;但是,我不确定这是否行得通.您可以创建ell1.traineddata
并在命令行中将其与现有的一起指定,例如:
You can unpack the existing .traineddata
and merge the components separately; however, I'm not sure that's going to work. You can create your ell1.traineddata
and specify it together with the existing one at the command line, such as:
tesseract image output -l ell+ell1
这篇关于合并经过训练的数据文件-Tesseract的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!