使用Pandas创建虚拟变量时Jupyter Notebook内核死亡 [英] Jupyter notebook kernel dies when creating dummy variables with pandas

查看：320 发布时间：2020/4/25 6:50:16 python pandas ipython-notebook azure-machine-learning-studio

本文介绍了使用Pandas创建虚拟变量时Jupyter Notebook内核死亡的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在从事沃尔玛Kaggle竞赛，并且正在尝试创建"FinelineNumber"列的虚拟列.对于上下文，df.shape返回(647054, 7).我正在尝试为df['FinelineNumber']创建一个虚拟列，该列具有5,196个唯一值.结果应该是形状为(647054, 5196)的数据框，然后我计划将其concat设置为原始数据框.

I am working on the Walmart Kaggle competition and I'm trying to create a dummy column of of the "FinelineNumber" column. For context, df.shape returns (647054, 7). I am trying to make a dummy column for df['FinelineNumber'], which has 5,196 unique values. The results should be a dataframe of shape (647054, 5196), which I then plan to concat to the original dataframe.

几乎每次我运行fineline_dummies = pd.get_dummies(df['FinelineNumber'], prefix='fl')时，都会收到以下错误消息The kernel appears to have died. It will restart automatically.我在具有16GB RAM的MacBookPro上的jupyter笔记本中运行python 2.7.

Nearly every time I run fineline_dummies = pd.get_dummies(df['FinelineNumber'], prefix='fl'), I get the following error message The kernel appears to have died. It will restart automatically. I am running python 2.7 in jupyter notebook on a MacBookPro with 16GB RAM.

有人可以解释为什么会发生这种情况(为什么它在大多数情况下都会发生，但并非每次都发生)?它是Jupyter笔记本还是熊猫虫?另外，我认为这可能与内存不足有关，但是在具有> 100 GB RAM的Microsoft Azure机器学习笔记本上出现了相同的错误.在Azure ML上，内核每次都几乎立即死亡.

Can someone explain why this is happening (and why it happens most of the time but not every time)? Is it a jupyter notebook or pandas bug? Also, I thought it might have to do with not enough RAM but I get the same error on a Microsoft Azure Machine Learning notebook with >100 GB of RAM. On Azure ML, the kernel dies every time - almost immediately.

使用Pandas创建虚拟变量时Jupyter Notebook内核死亡 [英] Jupyter notebook kernel dies when creating dummy variables with pandas

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

使用Pandas创建虚拟变量时Jupyter Notebook内核死亡 [英] Jupyter notebook kernel dies when creating dummy variables with pandas

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭