简单的(有效的)手写数字识别:如何改进? [英] Simple (working) handwritten digit recognition: how to improve it?

查看：106 发布时间：2020/5/6 11:29:36 python math ocr data-analysis svd

本文介绍了简单的(有效的)手写数字识别:如何改进?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我刚刚写了这个非常简单的手写数字识别. 被很好地识别为.

I just wrote this very simple handwritten digit recoginition. is well recognized as .

简而言之，将数据库的每个数字(50x50像素= 250个系数)汇总为10系数向量(通过保留10个最大的奇异值，请参见

In short, each digit of the database (50x50 pixels = 250 coefficients) is summarized into a 10-coefficient-vector (by keeping the 10 biggest singular values, see Low-rank approximation with SVD).

然后为了识别数字，我们将距离与数据库中的数字最小化.

Then for the digit to be recognized, we minimize the distance with the digits in the database.

from scipy import misc
import numpy as np
import matplotlib.pyplot as plt

digits = []
for i in range(11):
    M = misc.imread(str(i) + '.png', flatten=True)
    U, s, V = np.linalg.svd(M, full_matrices=False)
    s[10:] = 0        # keep the 10 biggest singular values only, discard others
    S = np.diag(s)
    M_reduced = np.dot(U, np.dot(S, V))      # reconstitution of image with 10 biggest singular values
    digits.append({'original': M, 'singular': s[:10], 'reduced': M_reduced})

# each 50x50 pixels digit is summarized into a vector of 10 coefficients : the 10 biggest singular values s[:10]    

# 0.png to 9.png = all the digits (for machine training)
# 10.png = the digit to be recognized
toberecognizeddigit = digits[10]    
digits = digits[:10]

# we find the nearest-neighbour by minimizing the distance between singular values of toberecoginzeddigit and all the digits in database
recognizeddigit = min(digits[:10], key=lambda d: sum((d['singular']-toberecognizeddigit['singular'])**2))    

plt.imshow(toberecognizeddigit['reduced'], interpolation='nearest', cmap=plt.cm.Greys_r)
plt.show()
plt.imshow(recognizeddigit['reduced'], interpolation='nearest', cmap=plt.cm.Greys_r)
plt.show()

问题:

该代码有效(您可以在ZIP存档中运行该代码)，但是我们如何才能对其进行改进以取得更好的效果?(我想象中的大多数是数学技术).

Question:

The code works (you can run the code in the ZIP archive), but how can we improve it to have better results? (mostly math techniques I imagine).

例如，在我的测试中，9和3有时会相互混淆.

For example in my tests, 9 and 3 are sometimes confused with each other.

简单的(有效的)手写数字识别:如何改进? [英] Simple (working) handwritten digit recognition: how to improve it?

问题描述

问题:

Question:

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

简单的(有效的)手写数字识别:如何改进? [英] Simple (working) handwritten digit recognition: how to improve it?

问题描述

问题:

Question:

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭