分类任务中的所有二元预测变量 [英] All binary predictors in a classification task

查看:148
本文介绍了分类任务中的所有二元预测变量的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在使用R进行分析,我将实现四种算法.

I am performing my analysis using R, I will be implementing four algorithms.

1. RF
2. Log Reg
3. SVM
4. LDA

我有50个预测变量和1个目标变量.我所有的预测变量和目标变量都是二进制数字0s和1s.

I have 50 predictors and 1 target variable. All my predictors and target variable are only binary numbers 0s and 1s.

我有以下问题:

Should I convert them all into factors?
Converting them into factors, and applying RF algorithms give 100% accuracy, I am very much surprised to see that as well.
Also, for other algorithms, how should i treat my variables priorly, before feeding them into my other algorithms.

谢谢

推荐答案

使用adaboost ...

Use adaboost...

研究一下不同的kaggle内核,尤其是梅赛德斯(Mercedes)内核,以实现实现adaboost的想法.

Take a look at different kaggle kernels, especially the Mercedes one, to get the idea of implementing adaboost.

https://www.kaggle.com/c/mercedes -benz-greener-manufacturing/内核

数据集包含数值和因子以及0s,1s.

The dataset is mixed of both numerical and factors and 0s,1s.

这篇关于分类任务中的所有二元预测变量的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆