pandas GroupBy.apply方法复制第一个组 [英] Pandas GroupBy.apply method duplicates first group

查看：68 发布时间：2020/5/23 21:13:29 python pandas group-by pandas-groupby

本文介绍了 pandas GroupBy.apply方法复制第一个组的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的第一个SO问题: 我对在熊猫(0.12.0-4)中groupby的apply方法的这种行为感到困惑，它似乎将TWICE函数应用于数据帧的第一行.例如:

My first SO question: I am confused about this behavior of apply method of groupby in pandas (0.12.0-4), it appears to apply the function TWICE to the first row of a data frame. For example:

>>> from pandas import Series, DataFrame
>>> import pandas as pd
>>> df = pd.DataFrame({'class': ['A', 'B', 'C'], 'count':[1,0,2]})
>>> print(df)
   class  count  
0     A      1  
1     B      0    
2     C      2

我首先检查groupby函数是否可以正常工作，这似乎还不错:

I first check that the groupby function works ok, and it seems to be fine:

>>> for group in df.groupby('class', group_keys = True):
>>>     print(group)
('A',   class  count
0     A      1)
('B',   class  count
1     B      0)
('C',   class  count
2     C      2)

然后我尝试对groupby对象应用apply来做类似的事情，并且两次获得第一行输出:

Then I try to do something similar using apply on the groupby object and I get the first row output twice:

>>> def checkit(group):
>>>     print(group)
>>> df.groupby('class', group_keys = True).apply(checkit)
  class  count
0     A      1
  class  count
0     A      1
  class  count
1     B      0
  class  count
2     C      2

任何帮助将不胜感激！谢谢.

Any help would be appreciated! Thanks.

@Jeff提供以下答案.我很忙，并没有立即理解它，因此，这是一个简单的示例，显示尽管上面的示例中第一组的两次打印输出，apply方法仅对第一组操作一次，并且不会改变原始数据帧:

@Jeff provides the answer below. I am dense and did not understand it immediately, so here is a simple example to show that despite the double printout of the first group in the example above, the apply method operates only once on the first group and does not mutate the original data frame:

>>> def addone(group):
>>>     group['count'] += 1
>>>     return group

>>> df.groupby('class', group_keys = True).apply(addone)
>>> print(df)

      class  count
0     A      1
1     B      0
2     C      2

但是通过将方法的返回值分配给新对象，我们看到它可以按预期工作:

But by assigning the return of the method to a new object, we see that it works as expected:

df2 = df.groupby('class'，group_keys = True).apply(addone) 打印(df2)

df2 = df.groupby('class', group_keys = True).apply(addone) print(df2)

      class  count
0     A      2
1     B      1
2     C      3

pandas GroupBy.apply方法复制第一个组 [英] Pandas GroupBy.apply method duplicates first group

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

pandas GroupBy.apply方法复制第一个组 [英] Pandas GroupBy.apply method duplicates first group

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭