扫描的文档中的背景/前景层分离 [英] Separation of background/foreground layers in a scanned document

查看:160
本文介绍了扫描的文档中的背景/前景层分离的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我需要自动删除OCR扫描文档图像的轻微彩色背景。

I need to automatically remove the mildly colored background of a scanned document image for OCR.

ScanTailor 是一个开源C ++基于GUI的应用程序,并除其他事项外背景分离,但我不能弄清楚如何只运行最后一步,实际上消除了背景。

ScanTailor is an open source C++ GUI-based app that does background separation among other things, but I cannot figure out how to run only the last step which actually removes the background.

在理想情况下,我能找到code,这是否与之一:

Ideally, I could find the code that does this and either:


  1. 端口部分为C#

  2. 修改C ++来响应命令行执行,只是执行给定的图像上步

你能帮助我理解我该怎么办要么?结果
或者你知道,能做到这一点其他图书馆? (任何语言/平台接受)

Can you help me understand how I can do either?
or do you know other libraries that can do this? (any language/platform acceptable)

推荐答案

您指的阈值,去噪及噪声消除技术,这是必要的OCR应用程序。

You are referring to Thresholding, Despeckling and Noise Removal techniques which are necessary in OCR applications.

结果的质量很大程度上取决于一个许多不同的因素 -

The quality of the results depends very much an many different factors -

打印原有的质量
扫描质量
图像分辨率
背景颜色和图案的使用。
噪声和其他标记。

Print quality of the original Scan quality Image resolution Background colours and patterns used. Noise and other marks.

您可能会发现在 http://www.hi-components.com的IEvolution.NET库/nievolution.asp 有用。它有许多图像处理功能一起玩。

You may find the IEvolution.NET library at http://www.hi-components.com/nievolution.asp useful. It has many image processing functions to play with.

有许多可用的商用发动机。还有就是要解决图像处理问题,没有一个完善的功能。你必须适应的功能和参数,以配合您的图像。 http://www.recogniform.com/thresholding.htm

There are many commercial engines available. There is no one perfect function to solve image processing problems. You must adapt the functions and parameter to match your images. http://www.recogniform.com/thresholding.htm

  • Best threshold for converting grayscale to black and white
  • Adaptive threshold binarization: post-processing for removing ghost objects.
  • Adaptive threshold Binarization's bad effects
  • fast threshold and bit packing algorithm ( possible improvements ? )

一个谷歌搜索会显示很多结果。

A Google search will show up lots of results.

这篇关于扫描的文档中的背景/前景层分离的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆