如何从图像生成Tiff/Box文件以在Windows中训练Tesseract [英] How to generate a tiff/box file from an image to train Tesseract in Windows

查看:587
本文介绍了如何从图像生成Tiff/Box文件以在Windows中训练Tesseract的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在尝试在Windows中训练Tesseract,为此,我需要一个成对的tiff/box文件,并尝试使用jTessBoxEditor创建它,但它不接受图像作为输入.我也尝试过boxFactory,但无法正常运行.有谁知道从图像创建配对的最佳工具是什么?

I'm trying to train Tesseract in Windows and for that I need a pair tiff/box file and I'm trying to create it using jTessBoxEditor but it doesn't accept images as input. I've also tried boxFactory but it doesn't run properly. Does anyone know what is the best tool to create the pair from images?

谢谢

推荐答案

如果您具有jTessBoxEditor,则您具有Tesseract bin文件.转到jTessBoxEditor的 tesseract-ocr 子文件夹,然后运行以下命令:

If you have jTessBoxEditor, then you have Tesseract bin files. Go to the tesseract-ocr subfolder of jTessBoxEditor and run the following command :

tesseract.exe D:\ testocr \ TestImage.tif D:\ testocr \ TestImage batch.nochop makebox

tesseract.exe D:\testocr\TestImage.tif D:\testocr\TestImage batch.nochop makebox

它应该生成文件 D:\ testocr \ TestImage.box . 然后在jTessBoxEditor中,转到框编辑器"选项卡并打开图像.盒子文件会自动加载,您可以检查一切是否正常,并纠正可能的错误.

It should generate the file D:\testocr\TestImage.box. Then in jTessBoxEditor, go to Box Editor tab and open your image. The box file is automatically loaded, you can check if everything is ok and correct possible mistakes.

这篇关于如何从图像生成Tiff/Box文件以在Windows中训练Tesseract的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆