如何在Ubuntu/Linux发行版中安装Tesseract-OCR 3.03? [英] How does one install Tesseract-OCR 3.03 in Ubuntu/Linux distributions?

查看:284
本文介绍了如何在Ubuntu/Linux发行版中安装Tesseract-OCR 3.03?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我和一个朋友对培训用于CV项目的tesseract-OCR引擎感兴趣.我们尝试使用一些包装器(例如PyTesser和pyocr),但结果目前还不如我们需要的准确.因此,我们想尝试训练Tesseract以更好地实现我们的目的(即识别食品标签上的文字),但是在安装训练工具时遇到了一些麻烦.

A friend and I are interested in training the tesseract-OCR engine for a CV project. We tried using some wrappers such as PyTesser and pyocr, but the results are currently not as accurate as we need them to be. As such, we want to try training the tesseract to perform better for our purposes (i.e. identifying text on food labels), but are having some trouble installing the training tools.

我们尝试过的事情:

在Google代码网站上,查看tesseract的Google代码Wiki上的编译"页面说培训工具仅在3.03版上可用.但是,针对tesseract-ocr的Google代码下载"页面仅包含3.02的材料. 编译"页面的底部还包含有关在Windows和OSX上安装3.03版的一些注释,但对于Linux用户尚未有任何注释.

Looking on the google code website, the 'Compiling' page on the tesseract's google code wiki says the training tools are only available on version 3.03. However, the google code 'Downloads' page for tesseract-ocr only has the materials for 3.02. The bottom of the 'Compiling' page also has some comments about installing version 3.03 on Windows and OSX, but no comments yet for Linux users.

似乎还存在某种 Ubuntu 3.03源码包,但我们不知道如何在计算机上访问它,并且编译"页面显示我们需要运行以下命令:

There also appears to be some sort of 3.03 source package for Ubuntu but we're not sure how to access it on our computers and the 'Compiling' page says we need to run these commands:

make training
sudo make training-install

我们还找到了有关tesseract 3.03的 google群组线程似乎这些帖子未包含针对Linux用户的建议(除非我们在初次阅读时错过了一些内容).

We've also found a google group thread about tesseract 3.03 but again it seems like these posts do not include advice for Linux users (unless we missed something during the initial read).

这实际上是一个非常简单的命令行安装问题吗?或者,有没有一种使用3.02(我们当前已安装)的tesseract火车的方式?我们是否在错误的地方寻找信息?

Is this actually a really simple command-line install problem? Or, is there a way train tesseract with 3.02 (which we currently have installed)? Have we been looking at the wrong places for information?

任何为Linux发行版安装tesseract-ocr 3.03的建议或说明链接将不胜感激!谢谢.

Any advice or links to instructions for installing tesseract-ocr 3.03 for Linux distributions would be greatly appreciated! Thanks.

推荐答案

Tesseract可以使用

Tesseract can directly be installed in Ubuntu 14.04 using

sudo apt-get install tesseract-ocr

我不知道您是否可以在较旧版本的Ubuntu中执行此操作,因为该回购协议可能会在较新版本的Ubuntu中进行更新.

I don't have any idea if you can do it in older version of Ubuntu because the repo might be updated in later version of Ubuntu.

这篇关于如何在Ubuntu/Linux发行版中安装Tesseract-OCR 3.03?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆