Pandas 0.25.0:分类分组 [英] Pandas 0.25.0: groupby on categoricals

查看：64 发布时间：2021/6/14 18:32:45 pandas pandas-groupby

本文介绍了Pandas 0.25.0:分类分组的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我在使用上个月发布的 Pandas 0.25.0 时遇到了一些困难.

I have some difficulties on using Pandas 0.25.0, which is released last month.

考虑这个日期框架:

df = pd.DataFrame({
    'A': pd.Series(['a', 'b', 'b', 'a'], dtype='category'),
    'B': pd.Series(['m', 'o', 'o', 'o']),
    'C': pd.Series([1, 2, 3, 4]),
})

假设我们要对前两列进行分组.结果数据框应该包含 3 行，因为组合 b m 不存在.

Say we want to groupby on the first two columns. The resulting data frame should contain 3 rows, since the combination b m doesn't exist.

df.groupby(['A', 'B']).agg({'C': 'sum'})

在 Pandas 0.24.1 及更早版本中，这可以正常工作:

In Pandas 0.24.1 and earlier, this works fine:

     C
A B   
a m  1
  o  4
b o  5

然而，在 Pandas 0.25.0 中这被破坏了:

However, in Pandas 0.25.0 this is broken:

       C
A B     
a m  1.0
  o  4.0
b m  NaN
  o  5.0

我知道我可以通过将 observed=True 添加到 groupby 调用来抑制这种不需要的行为，但这在旧版本中不是必需的.我在发行说明中找不到任何相关内容.

I know I can suppress this unwanted behaviour by adding observed=True to the groupby call, but that was not neccessary in the old version. I cannot find anything related in the release notes.

怎么会?这是熊猫中的错误吗?我错过了什么吗?

How come? Is this a bug in pandas? Did I miss something?

Pandas 0.25.0:分类分组 [英] Pandas 0.25.0: groupby on categoricals

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

Pandas 0.25.0:分类分组 [英] Pandas 0.25.0: groupby on categoricals

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭