Weka上的烟雾和欠采样的组合 [英] combination of smote and undersampling on weka

查看：194 发布时间：2020/10/2 3:22:49 dataset classification data-mining

本文介绍了Weka上的烟雾和欠采样的组合的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

根据chawla等人的论文（2002年），
平衡数据的最佳性能是将欠采样与SMOTE相结合。

according to paper which written by chawla, et al (2002) the best perfomance of balancing data is combining undersampling with SMOTE.

试图使用欠采样和SMOTE（
）组合我的数据集，但我对欠采样的属性有些困惑。

I’ve tried to combine my dataset using under-sampling and SMOTE, but I am bit confuse about the attribute for under-sampling.

在Weka中，减少多数阶层。
在Resample中有一个属性
biasToUniformClass-是否对统一类使用偏见。值为0会使类分布保持原样，值为1则确保输出数据中的类分布是均匀的。

In weka there is Resample to decrease the majority class. there is a attribute in Resample biasToUniformClass -- Whether to use bias towards a uniform class. A value of 0 leaves the class distribution as-is, a value of 1 ensures the class distribution is uniform in the output data.

我使用值0，而将多数类减少了，少数类也减少了，当我使用值1时，多数类的数据减少了，而少数类中的数据增加了。

I use value 0 and the data in majority class is down so the minority do and when I use value 1, the data in majority decrease but in minority class, the data is up.

我尝试使用值该属性为1，但我不使用smote来增加少数类的实例，因为数据已经平衡并且结果也很好。

I try to use value 1 for that attribute, but I don't using smote to increase the instances of minority class because the data is already balance and the result is good too.

所以，是就像我将SMOTE和欠采样合并在一起一样，还是我仍然必须尝试在该属性中使用值0并执行SMOTE吗？

so, is that the same as I combine the SMOTE and under-sampling or I still have to try with value 0 in that attribute and do the SMOTE ?

Weka上的烟雾和欠采样的组合 [英] combination of smote and undersampling on weka

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

Weka上的烟雾和欠采样的组合 [英] combination of smote and undersampling on weka

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭