在磁盘上保留numpy数组的最佳方法 [英] best way to preserve numpy arrays on disk

查看：120 发布时间：2020/5/18 18:40:18 python numpy pickle binary-data preserve

本文介绍了在磁盘上保留numpy数组的最佳方法的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在寻找一种保留大型numpy数组的快速方法.我想将它们以二进制格式保存到磁盘中，然后相对较快地将它们读回到内存中.不幸的是，cPickle不够快.

I am looking for a fast way to preserve large numpy arrays. I want to save them to the disk in a binary format, then read them back into memory relatively fastly. cPickle is not fast enough, unfortunately.

我找到了 numpy.savez 和 numpy.load .但是奇怪的是，numpy.load将一个npy文件加载到内存映射"中.这意味着对数组的常规操作确实很慢.例如，这样的事情真的很慢:

I found numpy.savez and numpy.load. But the weird thing is, numpy.load loads a npy file into "memory-map". That means regular manipulating of arrays really slow. For example, something like this would be really slow:

#!/usr/bin/python
import numpy as np;
import time; 
from tempfile import TemporaryFile

n = 10000000;

a = np.arange(n)
b = np.arange(n) * 10
c = np.arange(n) * -0.5

file = TemporaryFile()
np.savez(file,a = a, b = b, c = c);

file.seek(0)
t = time.time()
z = np.load(file)
print "loading time = ", time.time() - t

t = time.time()
aa = z['a']
bb = z['b']
cc = z['c']
print "assigning time = ", time.time() - t;

更准确地说，第一行会非常快，但是将数组分配给obj的其余行却很慢:

more precisely, the first line will be really fast, but the remaining lines that assign the arrays to obj are ridiculously slow:

loading time =  0.000220775604248
assining time =  2.72940087318

有没有更好的方法来保存numpy数组?理想情况下，我希望能够在一个文件中存储多个数组.

Is there any better way of preserving numpy arrays? Ideally, I want to be able to store multiple arrays in one file.

在磁盘上保留numpy数组的最佳方法 [英] best way to preserve numpy arrays on disk

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

在磁盘上保留numpy数组的最佳方法 [英] best way to preserve numpy arrays on disk

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭