pandas.hashtable.PyObjectHashTable.get_item中的Python pandas groupby键错误 [英] Python pandas groupby key error in pandas.hashtable.PyObjectHashTable.get_item

查看:82
本文介绍了pandas.hashtable.PyObjectHashTable.get_item中的Python pandas groupby键错误的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在做一个看似简单的Pandas小组成员.该列是没有NaN或奇怪字符串的字符串列.但是,我不断收到以下错误.有谁知道为什么会这样吗?我觉得这可能与我的数据有关,但是一切似乎都没问题...

I am doing what seems to be a simple group by in Pandas. The column is a string column with no NaN's or weird strings. However, I keep getting the below error. Does anyone know why this mights happen? I feel like it may have something to do with my data, but it all seems to be ok...

我正在运行by_user = df.groupby('User')

和堆栈跟踪:

by_user = df.groupby('User')
File "c:\Anaconda\lib\site-packages\pandas\core\generic.py", line 2773, in groupby
sort=sort, group_keys=group_keys, squeeze=squeeze)
File "c:\Anaconda\lib\site-packages\pandas\core\groupby.py", line 1142, in groupby
return klass(obj, by, **kwds)
File "c:\Anaconda\lib\site-packages\pandas\core\groupby.py", line 388, in __init__ level=level, sort=sort)
File "c:\Anaconda\lib\site-packages\pandas\core\groupby.py", line 2041, in _get_grouper
gpr = obj[gpr]
File "c:\Anaconda\lib\site-packages\pandas\core\frame.py", line 1678, in __getitem__
return self._getitem_column(key)
File "c:\Anaconda\lib\site-packages\pandas\core\frame.py", line 1685, in _get      item_column
return self._get_item_cache(key)
File "c:\Anaconda\lib\site-packages\pandas\core\generic.py", line 1052, in _ge
t_item_cache
values = self._data.get(item)
File "c:\Anaconda\lib\site-packages\pandas\core\internals.py", line 2565, in get
loc = self.items.get_loc(item)
File "c:\Anaconda\lib\site-packages\pandas\core\index.py", line 1181, in get_loc
return self._engine.get_loc(_values_from_object(key))
File "index.pyx", line 129, in pandas.index.IndexEngine.get_loc (pandas\index.
c:3656)
File "index.pyx", line 149, in pandas.index.IndexEngine.get_loc (pandas\index.
c:3534)
File "hashtable.pyx", line 696, in pandas.hashtable.PyObjectHashTable.get_item
(pandas\hashtable.c:11911)
File "hashtable.pyx", line 704, in pandas.hashtable.PyObjectHashTable.get_item
(pandas\hashtable.c:11864)
KeyError: 'User'

df.info():

User Code        175167 non-null object
Version          175167 non-null object
Date Accessed    175167 non-null datetime64[ns]
Series           175167 non-null object
Software         175167 non-null object
User             175167 non-null object

推荐答案

[已从评论中移出]

[moved from comments]

很容易错过列名中的结尾空格,但是您可以手动检查df.columns:

It's easy to miss trailing whitespace in column names, but you can check df.columns manually:

>>> df = pd.DataFrame({"User": [1,2]})
>>> df2 = pd.DataFrame({"User ": [1,2]})
>>> df
   User
0     1
1     2
>>> df2
   User 
0      1
1      2
>>> df.columns
Index([u'User'], dtype='object')
>>> df2.columns
Index([u'User '], dtype='object')

(为了稍微拉开窗帘,我怀疑这样的事情可能会发生,因为当我模拟自己的DataFrame并查看df.info()时,我没有看到列名和数字似乎显示在您的输出中.)

(To peel back the curtain a bit, I suspected something like this might be going on because when I mocked up my own DataFrame and looked at df.info(), I didn't see as much space between the column names and the numbers as your output seemed to show.)

这篇关于pandas.hashtable.PyObjectHashTable.get_item中的Python pandas groupby键错误的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆