eclipse上的JAVA OCR API开源 [英] JAVA OCR API open-source on eclipse

查看:949
本文介绍了eclipse上的JAVA OCR API开源的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我不明白OCR的任何东西。我只需要(甚至可能)得到.jar导入到Eclipse,我可以给出一个图像(.bmp,.tiff,.jpeg)作为输入,并输出一个字符串与图像的文本内容。这不是验证码或手写笔记,只是一些简单的照片的一些字母。



开源解决方案欣赏。我看到像Tess4j或JavaOCR这样的项目,但是它们有点复杂:(



任何人都有建议?



更新:我发现AspriseOCR非常容易使用,只需将.jar导入到构建路径并调用方法即可,但是...我无法在自己的项目中使用它!知道任何图书馆简单的Asprise(我不需要编译任何东西,或有任何类型的培训,只是导入,就是它!)

解决方案

据我所知,没有本机的开源Java OCR SDK,有Java API可以包含本机接口的调用,例如,最流行的开源OCR引擎之一--Tesseract( http://groups.google.com/group/tesseract-ocr/ ) - 有一些Java包装器像tesjeract( http://code.google.com/p/tesjeract/ )或Tess4J(<啊ref =http://tess4j.sf.net/ =noreferrer> http://tess4j.sf.net/ )。这可能适用于您,但设置起来相当困难,需要在您身边开发图像预处理和字体培训。



另一个解决方案可能是云服务。它需要最终用户应用程序来连接互联网,但它与您的编程语言选择和资源限制无关。看看 http://ocrsdk.com ,它是一个基于云的OCR SDK,可让您通过网络上传图像API并返回OCRed数据。这个基于Web API的OCR SDK不是免费的,这可能不适合您,但我仍然建议您尝试(免费试用),因为它的定价与企业解决方案相比是真正实惠的,同时它提供企业级OCR的准确度比开源更好。您还可以在github找到有用的 Java代码示例。 Disclamer:我在@ ABBYY工作。


I don't understand anything of OCR stuff. I just need (if it's even possible) to get a .jar to import to Eclipse which I can give an image (.bmp, .tiff, .jpeg) for input and it outputs me a string with the text content of the image. It's not captchas or handwritting, just a simple "photo" of some letters.

Open-source solutions are appreciated. I saw projects like Tess4j or JavaOCR but they are a bit complicated to handle :(

Anyone has a suggestion?

Update: I've found that AspriseOCR is very easy to use. Just import the .jar to the build path and call a method. Quite good! But... I can't use it on my own projects! Do you know any library as simple as Asprise (I don't need to compile anything, or have any type of training: just import and that's it!!)

解决方案

As far as i know there are no native opensource Java OCR SDKs. There are Java APIs which wrap calls for native interfaces, for example, for one of the most popular opensource OCR engines - Tesseract (http://groups.google.com/group/tesseract-ocr/) - there are some Java wrappers like tesjeract (http://code.google.com/p/tesjeract/) or Tess4J (http://tess4j.sf.net/). That could work for you, but it's rather hard to set up and will require developing image-preprocessing and font training on your side.

One more solution could be a cloud service. It requires end-user application to have the internet connection, but it's independent from your programming language choice and resources limitations. Have a look at http://ocrsdk.com, it's a cloud-based OCR SDK that let you upload an image through web API and returns you the OCRed data. This Web API based OCR SDK is not free, which may not be suitable for you, but i still recommend you try it out (it has a free trial) as its pricing is really affordable in comparison with enterprise solutions while it provides enterprise-level OCR accuracy which is way better than open source. You may also find useful this Java codesample at github. Disclamer: i work @ ABBYY.

这篇关于eclipse上的JAVA OCR API开源的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆