腌 pandas 数据框的最快方法是什么? [英] What's the fastest way to pickle a pandas DataFrame?
本文介绍了腌 pandas 数据框的最快方法是什么?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
使用Pandas内置方法或pickle.dump
哪个更好?
Which is better, using Pandas built-in method or pickle.dump
?
标准的泡菜方法如下:
pickle.dump(my_dataframe, open('test_pickle.p', 'wb'))
Pandas内置方法如下:
The Pandas built-in method looks like this:
my_dataframe.to_pickle('test_pickle.p')
推荐答案
感谢@qwwqwwq,我发现pandas内置了用于数据帧的to_pickle
方法.我做了一个快速的时间测试:
Thanks to @qwwqwwq I discovered that pandas has a built-in to_pickle
method for dataframes. I did a quick time test:
In [1]: %timeit pickle.dump(df, open('test_pickle.p', 'wb'))
10 loops, best of 3: 91.8 ms per loop
In [2]: %timeit df.to_pickle('testpickle.p')
10 loops, best of 3: 88 ms per loop
因此,似乎内置函数仅略胜一筹(对我而言,这很有用,因为这意味着使用内置函数可能不值得重构代码)-希望这对某人有帮助!
So it seems that the built-in is only narrowly better (to me, this is useful because it means it's probably not worth refactoring code to use the built-in) - hope this helps someone!
这篇关于腌 pandas 数据框的最快方法是什么?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文