如何在 Spark 随机森林中训练数据时设置截止 [英] How to set cutoff while training the data in Random Forest in Spark

查看：31 发布时间：2021/11/14 21:12:35 apache-spark random-forest apache-spark-mllib

本文介绍了如何在 Spark 随机森林中训练数据时设置截止的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用 Spark Mlib 训练数据以使用随机森林算法进行分类.MLib 提供了一个 RandomForest 类，该类具有 trainClassifier 方法，可以执行所需的操作.

I am using Spark Mlib to train the data for classification using Random Forest Algorithm. The MLib provides a RandomForest Class which has trainClassifier Method which does the required.

我可以在训练数据集时设置阈值吗，类似于 R 的 randomForest 包中提供的 cutoff 选项.

Can I set a threshold value while training the data set, similar to the cutoff option provided in R's randomForest Package.

http://cran.r-project.org/web/包/randomForest/randomForest.pdf

我发现 MLib 的 RandomForest 类仅提供传递树数、杂质、类数等的选项，但没有像 threshold 或 cut 那样的选项关闭选项可用.可以通过任何方式完成吗.

I found the RandomForest Class of MLib provides options only to pass number of trees, impurity, number of classes etc but there is nothing like threshold or cut off option available. Can it be done by any way.

如何在 Spark 随机森林中训练数据时设置截止 [英] How to set cutoff while training the data in Random Forest in Spark

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何在 Spark 随机森林中训练数据时设置截止 [英] How to set cutoff while training the data in Random Forest in Spark

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭