joblib 与 pickle 的不同用例是什么? [英] What are the different use cases of joblib versus pickle?

查看：30 发布时间：2021/12/25 14:21:45 python pickle scikit-learn

本文介绍了joblib 与 pickle 的不同用例是什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

背景:我刚刚开始使用 scikit-learn，并在页面底部阅读关于 joblib 对比泡菜.

Background: I'm just getting started with scikit-learn, and read at the bottom of the page about joblib, versus pickle.

用joblib代替pickle可能更有意思(joblib.dump & joblib.load)，在大数据上效率更高，但是只能pickle到磁盘，不能到字符串

it may be more interesting to use joblib’s replacement of pickle (joblib.dump & joblib.load), which is more efficient on big data, but can only pickle to the disk and not to a string

我在 Pickle 上阅读了这个问答，Python 中 pickle 的常见用例不知道这里的社区是否可以分享joblib 和pickle 之间的区别?什么时候应该使用一个?

I read this Q&A on Pickle, Common use-cases for pickle in Python and wonder if the community here can share the differences between joblib and pickle? When should one use one over another?

推荐答案

joblib is usually significantly faster on large numpy arrays because it has a special handling for the array buffers of the numpy datastructure. To find about the implementation details you can have a look at the source code. It can also compress that data on the fly while pickling using zlib or lz4.
joblib also makes it possible to memory map the data buffer of an uncompressed joblib-pickled numpy array when loading it which makes it possible to share memory between processes.
if you don't pickle large numpy arrays, then regular pickle can be significantly faster, especially on large collections of small python objects (e.g. a large dict of str objects) because the pickle module of the standard library is implemented in C while joblib is pure python.
since PEP 574 (Pickle protocol 5) has been merged in Python 3.8, it is now much more efficient (memory-wise and cpu-wise) to pickle large numpy arrays using the standard library. Large arrays in this context means 4GB or more.
But joblib can still be useful with Python 3.8 to load objects that have nested numpy arrays in memory mapped mode with mmap_mode="r".

这篇关于joblib 与 pickle 的不同用例是什么?的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

joblib 与 pickle 的不同用例是什么? [英] What are the different use cases of joblib versus pickle?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

joblib 与 pickle 的不同用例是什么? [英] What are the different use cases of joblib versus pickle?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭