阿拉伯语的开源OCR [英] Open Source OCR for Arabic

查看:826
本文介绍了阿拉伯语的开源OCR的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在寻找一个OCR库或任何我可以使用它来从图像中读取阿拉伯字母的样本。我搜索了很多并且没有找到任何东西..请帮助
提前感谢。

I'm looking for an OCR library or any samples that I could use it to read Arabic letters from an image.i googled a lot and didn't find any thing..please help thanks in advance.

推荐答案

TesseractOCR 可能是最好的开源OCR引擎,并且它可以识别的内容非常灵活。它允许使用自定义数据进行培训,因此只要您愿意投入工作(即创建训练集),基本上任何语言都是可能的。

TesseractOCR is the probably the best open source OCR engine out there and is very flexible as to what it can recognize. It allows for training with custom data, so essentially any language is possible as long as your willing to put in the work (i.e. create the training set).

tesseract提供的工具(使用gui界面)可以帮助您创建数据集,您可以在其中指定字符的边界框和相应的转录。

There are tools provided by tesseract (with a gui interface) that can help create the data set where you specify the bounding box of characters and the corresponding transcription.

编辑:从其他帖子(上面已链接)注意到已经为3.01版创建了阿拉伯语培训集。您只需插入阿拉伯数据即可解决问题:)。

Noticed from another post (linked above) that a training set on Arabic has already been created for version 3.01. You'd just need to plug in the Arabic data and your problem is solved :).

这篇关于阿拉伯语的开源OCR的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆