通过 pandas 数据框中的列中的重复值进行汇总 [英] Aggregate by repeated values in a column in a data frame in pandas

查看：38 发布时间：2020/5/24 4:21:54 python pandas

本文介绍了通过 pandas 数据框中的列中的重复值进行汇总的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个数据框，如下所示:

I have a data frame as follows:

             value     identifier
2007-01-01  0.781611      55
2007-01-01  0.766152      56
2007-01-01  0.766152      57
2007-02-01  0.705615      55
2007-02-01  0.032134      56
2007-02-01  0.032134      57
2008-01-01  0.026512      55
2008-01-01  0.993124      56
2008-01-01  0.993124      57
2008-02-01  0.226420      55
2008-02-01  0.033860      56
2008-02-01  0.033860      57

如何按标识符列中的值进行汇总，如下所示:

How can I aggregate by the value in the identifier column, like this:

           value  
2007-01-01  0.766  # (average of identifiers 55, 56 and 57 for this date)
2007-02-01  0.25   
2008-01-01  etc... 
2008-02-01

推荐答案

如果索引是日期时间，则可以访问.date属性，如果不是，则可以使用df.index = pd.to_datetime(df.index)对其进行转换，然后对日期并计算平均值:

If your index is a datetime then you can access the .date attribute, if not you can convert it using df.index = pd.to_datetime(df.index) and then perform a groupby on the date and calc the mean:

In [214]:

df.groupby(df.index.date)['value'].mean()
Out[214]:
2007-01-01    0.771305
2007-02-01    0.256628
2008-01-01    0.670920
2008-02-01    0.098047
Name: value, dtype: float64

这篇关于通过 pandas 数据框中的列中的重复值进行汇总的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

通过 pandas 数据框中的列中的重复值进行汇总 [英] Aggregate by repeated values in a column in a data frame in pandas

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

通过 pandas 数据框中的列中的重复值进行汇总 [英] Aggregate by repeated values in a column in a data frame in pandas

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭