应用程序文本(不扫描,不是验证码) [英] ocr'ing application text (not scanned, NOT captchas)

查看:176
本文介绍了应用程序文本(不扫描,不是验证码)的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述



在某些应用程序中,当Windows没有进行任何字体平滑处理时,我已经取得了成功通过手动输入一个短语,在所有的窗口字体中显示,找到一个匹配,从那里我可以通过生成字体中的所有字母来映射每个字母图像到一个字母。



如果正在完成任何字体平滑处理,无论是由Windows还是由应用程序完成,这都不起作用。在OCR计算机生成的文本中,艺术的状态如何?它似乎应该比打破​​CAPTCHA或OCRing扫描文本更容易。我在哪里可以找到这方面的资源?到目前为止,我只找到关于CAPTCHA破解或OCR扫描文本的文章。



我更喜欢从Python轻松访问的解决方案,但是如果其他语言中有很好的解决方案,我将不会确定你是什么意思,但我想只是阅读带有OCR程序的文本可以很好地工作。

Tesseract对于扫描的文档来说是非常精确的,所以一个特定的字体可以轻而易举地阅读。这是我的Python OCR解决方案: Linux中的Python OCR模块?。 / p>

但是,您可以将每个角色生成为图像并在图像上找到位置。它(可能)工作,但我不知道平滑如何准确。


I'd like to interface an application by reading the text it displays.

I've had success in some applications when windows isn't doing any font smoothing by typing in a phrase manually, rendering it in all windows fonts, and finding a match - from there I can map each letter image to a letter by generating all letters in the font.

This won't work if any font smoothing is being done, though, either by Windows or by the application. What's the state of the art like in OCRing computer-generated text? It seems like it should be easier than breaking CAPTCHAs or OCRing scanned text. Where can I find resources about this? So far I've only found articles on CAPTCHA breaking or OCRing scanned text.

I prefer solutions easily accessible from Python, though if there's a good one in some other lang I'll do the work to interface it.

解决方案

I'm not exactly sure what you mean, but I think just reading the text with an OCR program would work well.

Tesseract is amazingly accurate for scanned documents, so a specific font would be a breeze for it to read. Here's my Python OCR solution: Python OCR Module in Linux?.

But you could generate each character as an image and find the locations on the image. It (might) work, but I have no idea how accurate it would be with smoothing.

这篇关于应用程序文本(不扫描,不是验证码)的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆