测试示例集属性应等于训练示例集 Rapidminer SVM 的 OR Superset [英] Test example set attributes should be equal to OR Superset of Training example set Rapidminer SVM

查看:30
本文介绍了测试示例集属性应等于训练示例集 Rapidminer SVM 的 OR Superset的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是 Rapid Miner 的新手,并在其中使用了 SVM Linear.我的模型是:


我制作了训练示例集,其中包含 3552 个示例和仅 2 个属性,我正在进行名义到数字的转换,通过通过 SVM 线性模型,然后在应用模型中连接模型输出.这可以.

在测试示例集中,我有 735 个示例,它们具有 2 个属性并进行名义到数字的转换,然后将此转换后的示例集应用于应用模型.在此阶段,我在运行该过程时收到错误消息,内容如下:

I am new to Rapid Miner and using SVM Linear in it. My model is as:


I made Training Example set which consist of 3552 examples and just 2 attributes and I am doing nominal to numeric conversion, passing through SVM Linear model and then connecting model output in applying model. This is fine.

In Test Example set, I have 735 examples with 2 attributes and doing nominal to numeric conversion and then applying this converted Example set to Applying Model. At this stage I am getting an error when I run the process, which says that:

我对此进行了大量搜索,但没有找到正确的方向.我将感谢您的帮助.

I searched a lot about this but did not get the right direction. I will be thankful for your help.

推荐答案

Nominal to Numeric 运算符将创建新属性,其名称将从输入属性的值中派生.当 dummy encoding 用于 coding type 参数时会发生这种情况.如果与训练数据相比,测试数据包含不同的值,则结果属性将不同.

The Nominal to Numeric operator will make new attributes whose names will be derived from the values of the input attributes. This happens when dummy encoding is used for the coding type parameter. If the test data contains different values when compared to the training data then the resulting attributes will be different.

要确认这是问题所在,请在 Nominal to Numeric 运算符之后设置断点并检查每个示例集的属性.

To confirm this is the problem, set a breakpoint after the Nominal to Numeric operators and examine the attributes of each example set.

您可以通过将参数设置为 unique integers 来更改运算符的工作方式,但这可能不适合您要解决的问题.

You can change how the operator works by setting the parameter to unique integers but this might not suit the problem you are trying to solve.

一种可能的解决方法是合并两个数据集,然后再次拆分它们.这具有为每个名义属性创建允许级别的效果,即使数据可能没有值的示例.然后每个拆分都可以与 Nominal to Numeric 运算符一起使用,它应该创建所有必需的属性.

One possible way to solve it is to combine the two data sets then split them again. This has the effect of creating allowed levels for each nominal attribute even though the data may not have an example of the value. Each split can then be used with the Nominal to Numeric operator and it should create all the required attributes.

这篇关于测试示例集属性应等于训练示例集 Rapidminer SVM 的 OR Superset的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆