在Python中使用mca包 [英] Using mca package in Python

查看：303 发布时间：2020/5/24 1:56:43 python-3.x pandas scikit-learn pca

本文介绍了在Python中使用mca包的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试使用 mca软件包 Python中的多重对应分析.

I am trying to use the mca package to do multiple correspondence analysis in Python.

我对如何使用它有些困惑.使用PCA，我希望拟合一些数据(即找到那些数据的主要成分)，然后我将能够使用我发现的主要成分进行转换看不见的数据.

I am a bit confused as to how to use it. With PCA I would expect to fit some data (i.e. find principal components for those data) and then later I would be able to use the principal components that I found to transform unseen data.

根据MCA文档，我无法确定最后一步的操作方法.我也不明白任何奇怪地用名字命名的属性和方法(例如.E，.L，.K，.k等)的作用.

Based on the MCA documentation, I cannot work out how to do this last step. I also don't understand what any of the weirdly cryptically named properties and methods do (i.e. .E, .L, .K, .k etc).

到目前为止，如果我的DataFrame的列中包含字符串(假定这是DF中的唯一列)，我会做类似的事情

So far if I have a DataFrame with a column containing strings (assume this is the only column in the DF) I would do something like

import mca
ca = mca.MCA(pd.get_dummies(df, drop_first=True))

从我能收集到的东西

ca.fs_r(1)

是df和

ca.L

应该是特征值(尽管我得到的1 s向量比我的特征数少一个元素?).

is supposed to be the eigenvalues (although I get a vector of 1s that is one element fewer that my number of features?).

现在，如果我还有更多具有相同功能的数据，假设为df_new，并假设我已将其正确转换为虚拟变量，那么如何为新数据找到与ca.fs_r(1)等效的数据

now if I had some more data with the same features, let's say df_new and assuming I've already converted this correctly to dummy variables, how do I find the equivalent of ca.fs_r(1) for the new data

在Python中使用mca包 [英] Using mca package in Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

在Python中使用mca包 [英] Using mca package in Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭