scikit IterativeImputer 中每列的 max_value 和 min_value [英] max_value and min_value for each column in scikit IterativeImputer

查看：60 发布时间：2021/7/16 20:19:53 python pandas scikit-learn sklearn-pandas imputation

本文介绍了scikit IterativeImputer 中每列的 max_value 和 min_value的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有这个包含 78 列和 5707 行的数据集.几乎每一列都有缺失值，我想用 IterativeImputer 来估算它们.如果我理解正确，它将根据其他列的信息对每一列进行更智能"的插补.

I have this data set with 78 columns and 5707 rows. Almost every column has missing values and I would like to impute them with IterativeImputer. If I understood it correctly, it will make a "smarter" imputation on each column based on the information from other columns.

但是，在插补时，我不希望插补值小于观察到的最小值或大于观察到的最大值.我意识到有 max_value 和 min_value 参数，但我不想对插补施加全局"限制，相反，我希望每列都有自己的 max_value 和 min_value(这是已经观察到的最大值和最小值).因为否则，列中的值没有意义(人数为负值，比率为负值等)

However, when imputing, I do not want the imputed values to be less than the observed minimum or more than the observed maximum. I realize there are max_value and min_value parameters, but I do not want to impose a "global" limit to the imputations, instead, I want each column to have its own max_value and min_value (which is the already observed maximum and minimum values). Because otherwise, the values in the columns do not make sense (negative values for headcounts, negative values for rates, etc.)

有办法实现吗?

scikit IterativeImputer 中每列的 max_value 和 min_value [英] max_value and min_value for each column in scikit IterativeImputer

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

scikit IterativeImputer 中每列的 max_value 和 min_value [英] max_value and min_value for each column in scikit IterativeImputer

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭