留一法交叉验证 [英] Leave-one-out cross-validation

查看：564 发布时间：2020/5/4 10:03:38 python machine-learning scikit-learn

本文介绍了留一法交叉验证的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试通过留一法交叉验证来评估多变量数据集，然后删除那些无法预测原始数据集的样本(Benjamini校正，FDR > 10%).

I am trying to evaluate a multivariable dataset by leave-one-out cross-validation and then remove those samples not predictive of the original dataset (Benjamini-corrected, FDR > 10%).

使用有关交叉验证的文档，我发现了假单出迭代器.但是，当试图获得第n倍的分数时，会出现一个例外，表明需要多个样本.为什么.predict()不能工作，而.score()不能工作?如何获得单个样品的分数?我需要使用其他方法吗?

Using the docs on cross-validation, I've found the leave-one-out iterator. However, when trying to get the score for the nth fold, an exception is raised saying that more than one sample is needed. Why does .predict() work while .score() doesn't? How can I get the score for a single sample? Do I need to use another approach?

代码失败:

from sklearn import ensemble, cross_validation, datasets

dataset = datasets.load_linnerud()
x, y = dataset.data, dataset.target
clf = ensemble.RandomForestRegressor(n_estimators=500)

loo = cross_validation.LeaveOneOut(x.shape[0])
for train_i, test_i in loo:
    score = clf.fit(x[train_i], y[train_i]).score(x[test_i], y[test_i])
    print('Sample %d score: %f' % (test_i[0], score))

产生的异常:

ValueError: r2_score can only be computed given more than one sample.

我不是要问为什么这行不通，而是要用另一种方法行得通.在对模型进行拟合/训练之后，如何测试单个样本对训练模型的拟合程度?

I am not asking why this doesn't work, but for a different approach that does. After fitting/training my model, how do I test how good a single sample fits the trained model?

留一法交叉验证 [英] Leave-one-out cross-validation

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

留一法交叉验证 [英] Leave-one-out cross-validation

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭