逐行字符提取 [英] Row by Row character extraction
问题描述
我正在从输入图像开始手写字符识别。以下是从输入图像中提取字符的代码
I am working on handwritten character recognition from input image. Here is the code which extracts characters from input image
%% Label connected components
[L Ne]=bwlabel(Ifill);
disp(Ne);
%% Measure properties of image regions
propied=regionprops(L,'BoundingBox');
hold on
%% Plot Bounding Box
for n=1:size(propied,1)
rectangle('Position',propied(n).BoundingBox,'EdgeColor','g','LineWidth',2)
end
hold off
%% Characters being Extracted
figure
for n=1:Ne
[r,c] = find(L==n);
n1=imagen(min(r):max(r),min(c):max(c));
imshow(~n1);
end
但此代码是从输入图像中随机提取字符。任何人都可以告诉我如何逐行提取字符?
But this code is extracting characters randomly from the input image. Can anyone please tell me how to extract the characters row by row?
推荐答案
假设:字符行之间有足够的分隔,这样至少有一个图像行在两行字符之间完全空白。
Assumption: There is enough separation between rows of characters, such that there is at least one image row that is completely blank between two rows of characters.
代码
%%// Input binary image
BW = Ifill;
%%// Label connected blobs
[L,NUM] = bwlabel(BW);
%%// Find centroid points for each blob
cc = bwconncomp(BW);
stats = regionprops(cc, 'Centroid');
cent_rowcol = vertcat(stats.Centroid);
%%// Find transitions of characters starts and ends along the rows
trans1 = [0 ;diff(sum(BW,2)>0)];
%%// Find row midpoints for each row
row_midpts = (find(trans1==1) + find(trans1==-1))/2;
%%// Find the new labels based on row by row sorting
[~,row_id] = min(abs(bsxfun(@minus,cent_rowcol(:,2),row_midpts')),[],2); %%//'
[~,sortedInd] = sort((row_id-1)*size(BW,1)+cent_rowcol(:,1));
%%// Assign the new labels
L1 = zeros(size(L));
for k=1:numel(sortedInd)
L1(L==sortedInd(k))=k;
end
%%// Testing: Show all the characters one by one row by row
figure,
for k=1:numel(sortedInd)
imshow(L1==k);
pause(0.2);
end
如何使用输出,L1:你可以使用 L1 == 1
获得第一个字符,第二个使用 L1 == 2
等等,如下所示代码末尾的 Testing 部分。
How to use the output, L1: You can get the first character with L1==1
, second with L1==2
and so on, as shown in the Testing section at the end of code.
这篇关于逐行字符提取的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!