按两列分组，忽略对的顺序 [英] Groupby two columns ignoring order of pairs

查看：68 发布时间：2020/10/17 0:23:48 python pandas dataframe group-by pandas-groupby

本文介绍了按两列分组，忽略对的顺序的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

假设我们有一个数据帧，如下所示：

Suppose we have a dataframe that looks like this:

    start   stop   duration
0   A       B      1
1   B       A      2
2   C       D      2
3   D       C      0

什么是最好的构造以下列表的方法：i）开始/停止对； ii）开始/停止对数； iii）启动/停止对的平均持续时间？在这种情况下，顺序无关紧要：（A，B）=（B，A）。

What's the best way to construct a list of: i) start/stop pairs; ii) count of start/stop pairs; iii) avg duration of start/stop pairs? In this case, order should not matter: (A,B)=(B,A).

所需输出： [[开始，停止，计数，平均持续时间]]

在此示例中： [[A，B，2,1.5]，[C，D，2,1]]

推荐答案

sort 的前两列（您可以就地执行此操作，或者创建副本并执行相同的操作；我已经完成了前者），然后 groupby 和 agg ：

sort the first two columns (you can do this in-place, or create a copy and do the same thing; I've done the former), then groupby and agg:

df[['start', 'stop']] = np.sort(df[['start', 'stop']], axis=1)

(df.groupby(['start','stop'])
   .duration
   .agg(['count', 'mean'])
   .reset_index()
   .values
   .tolist())
# [['A', 'B', 2, 1.5], ['C', 'D', 2, 1.0]]

这篇关于按两列分组，忽略对的顺序的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

按两列分组，忽略对的顺序 [英] Groupby two columns ignoring order of pairs

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

按两列分组，忽略对的顺序 [英] Groupby two columns ignoring order of pairs

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭