我想从图像文件中读取文本 [英] i want to read text from image file

查看:65
本文介绍了我想从图像文件中读取文本的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

plz给我同样的答案

plz give me the ans for the same

推荐答案

拾起图像,用眼睛看.

不难.

图像文件(例如.bmp文件)未编码为文本,因此需要OCR.

您将需要寻找一些第三方API,这些API提供扫描图像的功能并提供OCR功能.

如果您仍然想自己做,而这并不是一件容易的事,那么您需要阅读一下OCR的工作原理.
然后,您会为此提出一些算法.

您看到的不只是提取文本,还在于如何将在纸上看到的内容转换为实际的文本表示形式.

非常困难.
Pick up image, use eyes, read.

Not hard.

An image file, say a .bmp is not coded for text so an OCR will be required.

You will need to look for some third-party API that provides functionality to scan images and provides OCR functionality.

If you still want to do this yourself, and this is not a trivial job,then you would need to read about how OCR works.
Then you would come up with some algorithm for it.

You see it is not just about extracting text, it is about how to convert what you see on a paper into actual text representations.

Very Difficult.




您需要某种字符识别代码.尝试在此处搜索文章,或在Google上搜索字符识别",光学字符识别"或"OCR"代码.

希望能有所帮助. :)
Hi,

You need some sort of character recognition code. Try searching on here for an article or Google for ''Character Recognition'' ''Optical Character Recognition'' or ''OCR'' code.

Hope that helps. :)


Um ...将图像加载到Paint或其他图像处理程序中,然后查看屏幕上的图片吗? :-\

不,当然,这不是您要寻找的答案.

您需要的是一些OCR-光学字符识别软件.我建议Googling可以完成这项工作的第三方软件-它比尝试自己编写要可靠得多.有效的OCR的创建已经进行了许多年的研究,因此,除非您想为了学习它的完成方法而做,否则绝对没有必要自己做.

如果您真的必须自己执行此操作,请考虑一下任务:您要识别手写或打印吗?如果打印,什么字体?如果是手写的,您对草书(连接的字母)或活字印刷感兴趣吗?那不同的字体粗细呢? (粗体字体通常比普通字体宽许多像素).您是要坚持文本是纯文本背景,还是要处理噪音(例如传真中可能存在的噪音或上面写有文字的照片)?
Um ... load the image into Paint, or some other image-processing program, and look at the picture on the screen? :-\

No, of course, that''s not the answer you''re looking for.

What you need is some OCR - optical character recognition software. I recommend Googling for 3rd party software that will do the job - it''ll be far more reliable than attempting to write your own. Many, many years of research have going into the creation of effective OCR, so unless you want to do it for the sake of learning how it''s done, there''s absolutely no sense in doing it yourself.

If you really must do it yourself, think about the task: are you wanting to recognize handwriting or print? If print, what font(s)? If handwriting, are you interested in cursive (joined letters) or block printing? What about different font weights? (Bold font lines are typically many pixels wider than normal fonts). Do you want to insist that the text is on a plain background, or are you interested in handling noise (such as may be present in a fax, or photograph of something with writing on it)?


这篇关于我想从图像文件中读取文本的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆