随机森林分类weka [英] Randomforest classification weka

查看:169
本文介绍了随机森林分类weka的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

属性已保存在csv文件的11列中.如果列的顺序发生变化,请执行Randomforest&随机树每次都能给出不同的准确性吗?

The attributes have been saved in 11 columns in csv file. If the order of columns change, Do Randomforest & RandomTree could give different accuracy in each time?

推荐答案

特征的排序不会影响我所知道的任何分类器(除非是专门为此设计的分类器,例如针对时间序列和其他时间特征的特殊分类器),无论是神经网络,SVM,RandomForest,RandomTree还是NaiveBayes-只是数字简化,因为数组比集合快,而在幕后"则将它们视为无序集合(仅显示从它来自哪个维度).

Ordering of the features does not affect any of classifiers I know (except those which are specially designed to do so - like specialistic classifiers for time series and other temporal features), no matter if it is Neural Network, SVM, RandomForest, RandomTree or NaiveBayes - it is just a numerical simplification, as arrays are faster then sets, while "under the hood" they are treated as unordered sets (only with indicies showing from which dimension it comes from).

由于其概率/随机学习方法,每次运行代码时,特定分类器的输出都会发生变化.例如-神经网络具有随机初始化,RandomForests具有随机​​子采样等.

What can change is the output of the particular classifier each time you run your code due to its probabilistic/stochastic methods of learning. For example - neural networks have random initializations, RandomForests has random subsampling etc.

因此答案令人惊讶地是是的,它可以在列顺序更改后更改",但是原因不是顺序更改,而是事实上,这样做之后,内部随机数生成器已经经过了一些周期并会生成不同的数字.

So answer is suprisingly "yes, it can change after order of columns change", but the reason for this is not the change in order, but fact, that after you do so, the internal random number generator already passed some cycles and will generate different numbers.

这篇关于随机森林分类weka的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆