随机森林回归中样本的大小 [英] Size of sample in Random Forest Regression

查看:200
本文介绍了随机森林回归中样本的大小的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如果理解正确,则在计算随机森林估计量时通常会应用自举法,这意味着仅使用样本(i)中的数据(替换选择)来构建树(i).我想知道sklearn RandomForestRegressor 使用.

If understand correctly, when Random Forest estimators are calculated usually bootstrapping is applied, which means that a tree(i) is built only using data from sample(i), chosen with replacement. I want to know what is the size of the sample that sklearn RandomForestRegressor uses.

我唯一看到的是接近的东西:

The only thing that I see that is close:

bootstrap : boolean, optional (default=True)
    Whether bootstrap samples are used when building trees.

但是无法指定样本量的大小或比例,也无法告诉我默认样本量.

But there is no way to specify the size or proportion of the sample size, nor does it tell me about the default sample size.

我觉得应该有办法至少知道默认的样本量是多少,我想念的是什么?

I feel like there should be way to at least know what the default sample size is, what am I missing?

推荐答案

引导程序的样本大小始终是样本数.

The sample size for bootstrap is always the number of samples.

你没有遗漏任何东西,同样的问题在 邮件列表:

You are not missing anything, the same question was asked on the mailing list for RandomForestClassifier:

引导程序样本大小始终与输入样本大小相同.如果您愿意的话,可能会很欢迎更新文档的请求请求.

The bootstrap sample size is always the same as the input sample size. If you feel up to it, a pull request updating the documentation would probably be quite welcome.

这篇关于随机森林回归中样本的大小的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆