如何在张量流中并行加载数据? [英] How to load data parallelly in tensorflow?

查看：28 发布时间：2021/9/5 19:52:58 python tensorflow

本文介绍了如何在张量流中并行加载数据?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

首先介绍一下我的申请背景:

我的磁盘中有大约 500,000 个视频保存为 avi 文件，我会将它们用作训练样本.要使用它们，我们可以将它们同时加载到内存中，然后将每个批次输入到模型中进行训练，这是最简单的方法.但是，我的内存不够足以应付整个加载过程.因此我需要批量加载视频数据.但是你知道，解码一批(这里取 64 个)视频可能会花费很多时间，如果你连续这样做，我们将在数据加载部分浪费大量时间而不是计算.因此，我想并行批量加载数据，实际上就像 keras 中的 API fit_generator 一样.我想知道在 TensorFlow 中是否有一种现有的方法可以做到这一点.

There are about 500,000 videos saved as avi files in my disk and i will use them as training samples. To use them we can load them simultaneously into the memory and then feed each batch into the model for trianing, which is the easiest way. However my memory is NOT big enough for the whole loading. Therefore i need to load the video data batchly. But you know, decode a batch(take 64 here) of video might cost a lot of time and if you do that serially, we will waste a lot of time in the data loading part instead of computing. Thus i want to batchly load the data parallelly, in fact, just like the API fit_generator in keras. I wonder if there is a existing way to do that in TensorFlow.

感谢您的任何建议:)

PS:我曾经通过 Python 中的 theading 包来实现这个想法，有关更多信息，请访问这里 https://github.com/FesianXu/Parallel-DataLoader-in-TensorFlow

PS: i used to implement the idea by the theading package in Python, for more, visit here https://github.com/FesianXu/Parallel-DataLoader-in-TensorFlow

当然，这只是一个玩具代码，而且太临时了.我想要一个更通用的解决方案，就像 Keras 中的 fit_generator 一样.

of course it is just a toy code and too ad hoc. I wanna a more general solution just like fit_generator in Keras.

如何在张量流中并行加载数据? [英] How to load data parallelly in tensorflow?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

如何在张量流中并行加载数据? [英] How to load data parallelly in tensorflow?

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭