如何从scikits.learn分类器中提取信息，然后在C代码中使用 [英] How to extract info from scikits.learn classifier to then use in C code

查看：107 发布时间：2020/4/30 10:46:00 python svm libsvm scikits scikit-learn

本文介绍了如何从scikits.learn分类器中提取信息，然后在C代码中使用的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我已经使用Python中的scikits.learn训练了一堆RBF SVM，然后对结果进行腌制.这些是用于图像处理任务的，我要测试的一件事是在某些测试图像的每个像素上运行每个分类器.也就是说，从以像素(i，j)为中心的窗口中提取特征向量，在该特征向量上运行每个分类器，然后移至下一个像素并重复.这对于使用Python来说太慢了.

I have trained a bunch of RBF SVMs using scikits.learn in Python and then Pickled the results. These are for image processing tasks and one thing I want to do for testing is run each classifier on every pixel of some test images. That is, extract the feature vector from a window centered on pixel (i,j), run each classifier on that feature vector, and then move on to the next pixel and repeat. This is far too slow to do with Python.

说明::当我说这太慢了……"时，我的意思是，即使scikits.learn使用的Libsvm引擎盖下的代码也太慢了.我实际上是在为GPU编写手动决策功能，因此每个像素的分类都是并行进行的.

Clarification: When I say "this is far too slow..." I mean that even the Libsvm under-the-hood code that scikits.learn uses is too slow. I'm actually writing a manual decision function for the GPU so classification at each pixel happens in parallel.

是否可以用Pickle加载分类器，然后获取某种属性来描述如何从特征向量中计算决策，然后将该信息传递给我自己的C代码?对于线性SVM，我可以只提取权重向量和偏差向量，并将它们作为输入添加到C函数中.但是，对于RBF分类器来说，等效的操作是什么?如何从scikits.learn对象中获取该信息?

Is it possible for me to load the classifiers with Pickle, and then grab some kind of attribute that describes how the decision is computed from the feature vector, and then pass that info to my own C code? In the case of linear SVMs, I could just extract the weight vector and bias vector and add those as inputs to a C function. But what is the equivalent thing to do for RBF classifiers, and how do I get that info from the scikits.learn object?

已添加:首次尝试解决方案.

Added: First attempts at a solution.

看起来分类器对象具有属性support_vectors_，其中包含支持向量作为数组的每一行.还有一个属性dual_coef_，它是一个系数乘以len(support_vectors_)的数组.从有关非线性SVM的标准教程中可以看出，应该执行以下操作:

It looks like the classifier object has the attribute support_vectors_ which contains the support vectors as each row of an array. There is also the attribute dual_coef_ which is a 1 by len(support_vectors_) array of coefficients. From the standard tutorials on non-linear SVMs, it appears then that one should do the following:

从被测数据点计算特征向量v.这将是一个与support_vectors_行的长度相同的向量.
对于support_vectors_中的每一行i，计算该支持向量与v之间的平方欧几里德距离d[i].
将t[i]计算为gamma * exp{-d[i]}，其中gamma是RBF参数.
汇总所有i中的dual_coef_[i] * t[i].将scikits.learn分类器的intercept_属性的值添加到此总和.
如果总和为正，则分类为1.否则，分类为0.

Compute the feature vector v from your data point under test. This will be a vector that is the same length as the rows of support_vectors_.
For each row i in support_vectors_, compute the squared Euclidean distance d[i] between that support vector and v.
Compute t[i] as gamma * exp{-d[i]} where gamma is the RBF parameter.
Sum up dual_coef_[i] * t[i] over all i. Add the value of the intercept_ attribute of the scikits.learn classifier to this sum.
If the sum is positive, classify as 1. Otherwise, classify as 0.

已添加:在编号第9页的此

Added: On numbered page 9 at this documentation link it mentions that indeed the intercept_ attribute of the classifier holds the bias term. I have updated the steps above to reflect this.

如何从scikits.learn分类器中提取信息，然后在C代码中使用 [英] How to extract info from scikits.learn classifier to then use in C code

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

如何从scikits.learn分类器中提取信息，然后在C代码中使用 [英] How to extract info from scikits.learn classifier to then use in C code

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭