OCR中的草书字符分割 [英] cursive character segmentation in OCR

查看:137
本文介绍了OCR中的草书字符分割的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我已经为handwritten normal characters做了OCR application.对于字符的分割,我使用了histogram profile method.可以正常使用正常的英文字符.

I have done a OCR application for handwritten normal characters.For the segmentation of characters I have used histogram profile method. That successfully works for normal English characters.

我使用水平投影进行线段分割,使用垂直投影进行字符段分割.

I have used horizontal projection for line segmentation and vertical projection for character segmentation.

要分割草书手写的文章,我可以像以前一样使用水平投影.但是我不能使用相同的方法进行草书英语字符分割,因为它们彼此合并并且也倾斜.有人可以帮我分割草书字符的方法吗?

To segment lines of cursive hand written article I can use horizontal projection as previous. But I can't use same methodology for cursive English character segmentation since they are merged each other and also slanted. Can anyone please help me with a way to segment cursive characters.

推荐答案

由于作者和字符形状之间的差异,这是一个很难解决的问题.一种选择已达到了83%的精度,它是分析书写中的连字(字符之间的连接)并使用这些连字作为基点在图像上绘制列. 2013年,Procedia Computer Science提出了这种方法,并发表了有关此特定问题的研究报告:

This is a difficult problem to solve due to the variability between writers and character shapes. One option, which has achieved up to 83% accuracy, is to analyze the ligatures (connections between characters) in the writing and draw columns on the image using those ligatures as a base point. In 2013, Procedia Computer Science proposed this approach and published their research on this particular problem: https://ac.els-cdn.com/S1877050913001464/1-s2.0-S1877050913001464-main.pdf?_tid=5f55eac2-0077-11e8-9d79-00000aacb35f&acdnat=1516737513_c5b6e8cb8184f69b2d10f84cd4975d56

另一种尝试方法称为骨骼分析,该分析将单词作为一个整体,并将其形状与其他已知单词的形状相匹配,并根据整个图像预测单词.

Another approach to try is called skeletal analysis which takes the word as a whole and matches its shape with other known word shapes and predicts the word based on the entire image.

祝你好运!

这篇关于OCR中的草书字符分割的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆