与Tensorflow中的许多输入数据文件良好混合 [英] Getting good mixing with many input datafiles in tensorflow

查看：70 发布时间：2020/5/17 19:15:40 python neural-network binaryfiles tensorflow

本文介绍了与Tensorflow中的许多输入数据文件良好混合的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在与tensorflow合作，希望训练一个深层的CNN来为Go游戏做移动预测.我创建的数据集包含100,000个二进制数据文件，其中每个数据文件都对应于一个已录制的游戏，并且包含大约200个训练样本(游戏中的每个移动样本一个).我相信使用SGD时要获得良好的混音非常重要.我希望我的批次中包含来自不同游戏的样本以及来自游戏不同阶段的样本.因此，例如，仅从100个文件的开头读取一个样本，并且洗牌是不好的b/c，这100个样本将成为每个游戏的第一步.

I'm working with tensorflow hoping to train a deep CNN to do move prediction for the game Go. The dataset I created consists of 100,000 binary data files, where each datafile corresponds to a recorded game and contains roughly 200 training samples (one for each move in the game). I believe it will be very important to get good mixing when using SGD. I'd like my batches to contain samples from different games AND samples from different stages of the games. So for example simply reading one sample from the start of 100 files and shuffling isn't good b/c those 100 samples will all be the first move of each game.

我已经阅读了有关从文件中获取数据的教程，但不确定它们提供的库是否满足我的需求.如果我自己进行硬编码，那么我基本上会初始化一堆指向每个文件中随机位置的文件指针，然后从随机文件中提取样本，并相应地增加文件指针.

I have read the tutorial on feeding data from files but I'm not sure if their provided libraries do what I need. If I were to hard code it myself I would basically initialize a bunch of file pointers to random locations within each file and then pull samples from random files, incrementing the file pointers accordingly.

所以，我的问题是，tensorflow是否提供了这种功能，或者编写自己的代码来创建批处理会更容易吗?

So, my question is does tensorflow provide this sort of functionality or would it be easier to write my own code for creating batches?

与Tensorflow中的许多输入数据文件良好混合 [英] Getting good mixing with many input datafiles in tensorflow

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

与Tensorflow中的许多输入数据文件良好混合 [英] Getting good mixing with many input datafiles in tensorflow

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭