在一列中查找重复的单词,并根据使用 pandas 的出现次数对其进行排序 [英] Find repeated words in a column and sort it according to number of occurence using pandas

查看:83
本文介绍了在一列中查找重复的单词,并根据使用 pandas 的出现次数对其进行排序的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

        A                             B
 1) Italy                   Transport for London.....
 2) Italy                   Roseanne Barr Actor leavin..... 
 3) America                 Americas Transport for London........ 
 4) America                 Transport for London.....
 5) America                 Roseanne Barr Actor leavin..... 
 6) France                  Americas Transport for London........ 

这是我的excel工作表的示例,我希望它根据A列中的出现次数进行排序.就像America在该列中出现3次,然后将America的行移至The Top,并将Italy的行移至America and那么带有法国的行应该在结尾处,因为它在A列中的出现率最低.

This is a sample of my excel sheet I want it sorted according to the number of occurence in column A. Like America is present 3 times in column then rows with America Should be shifted to The Top and Rows with Italy Following America and then rows with france should be at end because it has lowest occurence in the column A.

推荐答案

使用 argsort in descending order 作为位置,并按:

Use transform with argsort in descending order for positions and select by iloc:

df = df.iloc[(-df.groupby('A')['A'].transform('size')).argsort()]
print (df)
          A                              B
3)  America  Americas Transport for London
4)  America           Transport for London
5)  America     Roseanne Barr Actor leavin
1)    Italy           Transport for London
2)    Italy     Roseanne Barr Actor leavin
6)   France  Americas Transport for London

或创建新列并排序:

df['new'] = df.groupby('A')['A'].transform('size')

df = df.sort_values('new', ascending=False)
print (df)
          A                              B  new
3)  America  Americas Transport for London    3
4)  America           Transport for London    3
5)  America     Roseanne Barr Actor leavin    3
1)    Italy           Transport for London    2
2)    Italy     Roseanne Barr Actor leavin    2
6)   France  Americas Transport for London    1

这篇关于在一列中查找重复的单词,并根据使用 pandas 的出现次数对其进行排序的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆