分数模型与交叉验证的分数数据集输出 [英] Scored datasets output from Score Model vs. Cross Validate

查看：151 发布时间：2019/6/19 1:57:36 MachineLearning

本文介绍了分数模型与交叉验证的分数数据集输出的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

你好

根据经验，最好在整个标记的数据集而不是子集上训练预测模型.我是在Python中以这种方式进行的.唯一的问题是评估模型.交叉验证是一种解决方法，因为该模型是在多个内部折叠中进行评估的，这些折叠被用作迭代测试数据集.我试图避免将数据集拆分为训练和测试子集，因为这将减少可用于训练模型的数据量.

As a rule of thumb, it is better to train a predictive model on an entire labelled dataset rather than a subset of it. I do it this way in Python. The only problem with that is in evaluating the model. Cross-validation is a way around that, as the model is evaluated over multiple, internal folds that are used as iterative test datasets. I seek to avoid Splitting the dataset into Train and Test subsets, as this will reduce the amount of data available to train the model.

我正在AzureML中构建一个模型，以制作一个Web服务，该服务将为未标记的数据集生成预测的标签.我正在大量标签数据上训练模型.

I am building a model in AzureML to make a Web Service that will generate predicted labels for an unlabeled dataset. I am training the model on a large set of labelled data.

在这种情况下，最好这样做:

In such a case, is it better to:

A.使用Tune Model Hyperparameters基于整个数据集(未提供验证数据集)生成经过训练的模型，将该模型输出到Cross Validate 由交叉验证生成，作为Web的最终输出服务，或

A. Use Tune Model Hyperparameters to generate a trained model based on an entire dataset (no validation dataset provided), output that model to Cross Validate along with the entire dataset, and use the Scored Dataset generated by Cross Validate as final output for the Web Service, or

B. ，将该模型连同整个数据集一起输出到得分模型，并且使用由评分模型

B. Use Tune Model Hyperparameters to generate a trained model based on an entire dataset (no validation dataset provided), output that model to Score Model along with the entire dataset, and use the Scored Dataset generated by Score Model as final output for the Web Service?

我最困惑的是交叉验证模块的计分数据集"输出.有没有人有使用该输出的任何示例?我从未见过.这意味着什么?

The main thing I am confused about is the Scored Dataset output of the Cross Validate module. Does anyone have any examples of that output being used? I have never seen one. What is it meant to be used for?

谢谢...

分数模型与交叉验证的分数数据集输出 [英] Scored datasets output from Score Model vs. Cross Validate

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录关闭

分数模型与交叉验证的分数数据集输出 [英] Scored datasets output from Score Model vs. Cross Validate

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录 关闭

登录关闭