逐行字符提取 [英] Row by Row character extraction

查看:155
本文介绍了逐行字符提取的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在从输入图像开始手写字符识别。以下是从输入图像中提取字符的代码

I am working on handwritten character recognition from input image. Here is the code which extracts characters from input image

  %% Label connected components
  [L Ne]=bwlabel(Ifill);
  disp(Ne);
  %% Measure properties of image regions
  propied=regionprops(L,'BoundingBox');
  hold on

  %% Plot Bounding Box
  for n=1:size(propied,1)
  rectangle('Position',propied(n).BoundingBox,'EdgeColor','g','LineWidth',2)
  end
  hold off

  %% Characters being Extracted
  figure
  for n=1:Ne
  [r,c] = find(L==n);
  n1=imagen(min(r):max(r),min(c):max(c));
  imshow(~n1);
  end

但此代码是从输入图像中随机提取字符。任何人都可以告诉我如何逐行提取字符?

But this code is extracting characters randomly from the input image. Can anyone please tell me how to extract the characters row by row?

原始代码

推荐答案

假设:字符行之间有足够的分隔,这样至少有一个图像行在两行字符之间完全空白。

Assumption: There is enough separation between rows of characters, such that there is at least one image row that is completely blank between two rows of characters.

代码

%%// Input binary image
BW = Ifill;

%%// Label connected blobs
[L,NUM] = bwlabel(BW);

%%// Find centroid points for each blob
cc = bwconncomp(BW);
stats = regionprops(cc, 'Centroid');
cent_rowcol = vertcat(stats.Centroid);

%%// Find transitions of characters starts and ends along the rows
trans1 = [0 ;diff(sum(BW,2)>0)]; 

%%// Find row midpoints for each row
row_midpts = (find(trans1==1) + find(trans1==-1))/2;

%%// Find the new labels based on row by row sorting
[~,row_id] = min(abs(bsxfun(@minus,cent_rowcol(:,2),row_midpts')),[],2); %%//'
[~,sortedInd] = sort((row_id-1)*size(BW,1)+cent_rowcol(:,1));

%%// Assign the new labels
L1 = zeros(size(L));
for k=1:numel(sortedInd)
    L1(L==sortedInd(k))=k;
end

%%// Testing: Show all the characters one by one row by row
figure,
for k=1:numel(sortedInd)
    imshow(L1==k);
    pause(0.2);
end

如何使用输出,L1:你可以使用 L1 == 1 获得第一个字符,第二个使用 L1 == 2 等等,如下所示代码末尾的 Testing 部分。

How to use the output, L1: You can get the first character with L1==1, second with L1==2 and so on, as shown in the Testing section at the end of code.

这篇关于逐行字符提取的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆