数据集不适合内存 [英] Dataset does not fit in memory

查看：71 发布时间：2021/6/13 19:30:03 memory-management tensorflow out-of-memory tflearn

本文介绍了数据集不适合内存的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个 MNIST 之类的数据集，它不适合内存(进程内存，非 GPU 内存).我的数据集是 4GB.

I have an MNIST like dataset that does not fit in memory, (process memory, not gpu memory). My dataset is 4GB.

这不是 TFLearn 问题.

据我所知，model.fit 需要 x 和 y 的 array.

As far as I know model.fit requires an array for x and y.

TFLearn 示例:

TFLearn example:

model.fit(x, y, n_epoch=10, validation_set=(val_x, val_y))

我想知道有没有一种方法可以传递批处理迭代器"而不是数组.基本上，对于每个批次，我都会从磁盘加载必要的数据.

I was wondering is there's a way where we can pass a "batch iterator", instead of an array. Basically for each batch I would load the necessary data from disk.

这样我就不会遇到进程内存溢出错误.

This way I would not run into process memory overflow errors.

编辑np.memmap 可能是一个选项.但我不知道如何跳过组成标题的前几个字节.

EDIT np.memmap could be an option. But I don't see how to skip the first few bytes that compose the header.

数据集不适合内存 [英] Dataset does not fit in memory

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

数据集不适合内存 [英] Dataset does not fit in memory

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭