为什么只有一个隐藏节点的IRIS数据集具有良好的准确性? [英] Why do I get good accuracy with IRIS dataset with a single hidden node?

查看:116
本文介绍了为什么只有一个隐藏节点的IRIS数据集具有良好的准确性?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个带有反向传播训练器的神经网络的最小示例,可以在IRIS数据集上对其进行测试.我从7个隐藏节点开始,而且效果很好.

I have a minimal example of a neural network with a back-propagation trainer, testing it on the IRIS data set. I started of with 7 hidden nodes and it worked well.

我将隐藏层中的节点数减少到1(期望它会失败),但是惊讶地发现准确性提高了.

I lowered the number of nodes in the hidden layer to 1 (expecting it to fail), but was surprised to see that the accuracy went up.

我在azure ml中设置了实验,只是为了验证这不是我的代码.在那里,只有一个隐藏节点,准确率达到98.3333%.

I set up the experiment in azure ml, just to validate that it wasn't my code. Same thing there, 98.3333% accuracy with a single hidden node.

有人可以向我解释这里发生了什么吗?

Can anyone explain to me what is happening here?

推荐答案

首先,已经很好地确定了各种分类模型对Iris产生了难以置信的良好结果(Iris非常可预测);例如,请参见此处.

First, it has been well established that a variety of classification models yield incredibly good results on Iris (Iris is very predictable); see here, for example.

第二,我们可以观察到虹膜数据集中的特征相对较少.此外,如果您查看数据集描述您会看到其中两个功能与班级成绩高度相关.

Secondly, we can observe that there are relatively few features in the Iris dataset. Moreover, if you look at the dataset description you can see that two of the features are very highly correlated with the class outcomes.

这些相关值是线性的单特征相关,这表明人们最有可能应用线性模型并观察到良好的结果.神经网络是高度非线性的.随着隐藏节点和隐藏层数量的增加,它们变得越来越复杂,并捕获越来越多的非线性特征组合.

These correlation values are linear, single-feature correlations, which indicates that one can most likely apply a linear model and observe good results. Neural nets are highly nonlinear; they become more and more complex and capture greater and greater nonlinear feature combinations as the number of hidden nodes and hidden layers is increased.

考虑到这些事实,(a)开头的功能很少,(b)与类别的线性相关性很高,所有这些都表明较不复杂的线性函数是适当的预测模型, -通过使用单个隐藏节点,您几乎可以使用线性模型.

Taking these facts into account, that (a) there are few features to begin with and (b) that there are high linear correlations with class, would all point to a less complex, linear function as being the appropriate predictive model-- by using a single hidden node, you are very nearly using a linear model.

还应该指出,在没有任何隐藏层(即仅输入和输出节点)的情况下,并且当使用logistic传递函数时,这等效于logistic回归.

It can also be noted that, in the absence of any hidden layer (i.e., just input and output nodes), and when the logistic transfer function is used, this is equivalent to logistic regression.

这篇关于为什么只有一个隐藏节点的IRIS数据集具有良好的准确性?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆