腌 pandas 数据框的最快方法是什么? [英] What's the fastest way to pickle a pandas DataFrame?

查看:96
本文介绍了腌 pandas 数据框的最快方法是什么?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

使用Pandas内置方法或pickle.dump哪个更好?

Which is better, using Pandas built-in method or pickle.dump?

标准的泡菜方法如下:

pickle.dump(my_dataframe, open('test_pickle.p', 'wb'))

Pandas内置方法如下:

The Pandas built-in method looks like this:

my_dataframe.to_pickle('test_pickle.p')

推荐答案

感谢@qwwqwwq,我发现pandas内置了用于数据帧的to_pickle方法.我做了一个快速的时间测试:

Thanks to @qwwqwwq I discovered that pandas has a built-in to_pickle method for dataframes. I did a quick time test:

In [1]: %timeit pickle.dump(df, open('test_pickle.p', 'wb'))
10 loops, best of 3: 91.8 ms per loop

In [2]: %timeit df.to_pickle('testpickle.p')
10 loops, best of 3: 88 ms per loop

因此,似乎内置函数仅略胜一筹(对我而言,这很有用,因为这意味着使用内置函数可能不值得重构代码)-希望这对某人有帮助!

So it seems that the built-in is only narrowly better (to me, this is useful because it means it's probably not worth refactoring code to use the built-in) - hope this helps someone!

这篇关于腌 pandas 数据框的最快方法是什么?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆