Tesseract - 使用与主要 OCR 不同的图像格式进行训练 [英] Tesseract - train with different image format than used for primary OCR

查看：43 发布时间：2021/9/6 18:34:09 tesseract

本文介绍了Tesseract - 使用与主要 OCR 不同的图像格式进行训练的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

正如在这个 SO 问题中所讨论的，tesseract 通常与 .png 文件而不是 .tiff 文件.(我自己也直接经历过这一点).不幸的是，可以处理 .png 文件的框编辑器较少.因此，我很想使用 .tiff 文件训练我的数据，但随后将 .png 文件用于我的主要 OCR 工作.这样做会降低培训的效果吗?如果是这样，有什么方法可以解决它(除了找到一个可以接受 .png 文件的框编辑器)?

As discussed on this SO Question, tesseract often operates better with .png files than with .tiff files. (I have also experienced this directly myself). Unfortunately, there are fewer box editors available that can handle .png files. I therefore am tempted to train my data using .tiff files but then use .png files for my main OCR work. Will doing so reduce the effectiveness of the training? If so, are there any ways to address it (other than just finding a box editor that can accept .png files)?

Tesseract - 使用与主要 OCR 不同的图像格式进行训练 [英] Tesseract - train with different image format than used for primary OCR

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Tesseract - 使用与主要 OCR 不同的图像格式进行训练 [英] Tesseract - train with different image format than used for primary OCR

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭