我可以使用二变量方法在Tensorflow(1.4)中异步预取到GPU吗? [英] Can I asynchronously prefetch to the GPU in tensorflow (1.4) using a two-variables approach?

查看：43 发布时间：2021/4/12 18:59:21 python-3.x asynchronous tensorflow

本文介绍了我可以使用二变量方法在Tensorflow(1.4)中异步预取到GPU吗?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

据我所知，在Tensorflow 1.4中，CPU/GPU之间仍然没有异步的预取数据.

这是我用于运行此测试的代码.它可以独立运行.

 将tensorflow导入为tf从tensorflow.python.client导入时间轴将numpy导入为np导入操作系统sz = 2000x = np.random.rand(sz，sz)def gen():产量x#数据集ds = tf.data.Dataset.from_generator(generator = gen，output_types = tf.float64)ds = ds.repeat()ds = ds.prefetch(2)迭代器= ds.make_one_shot_iterator()next_element = iterator.get_next()#预取到GPU OP-预计这会异步发生temp_var = tf.Variable(np.zeros((sz，sz))，name ='temp_var'，dtype = tf.float64，Expected_shape =(sz，sz)，trainable = False)data_var = tf.Variable(np.zeros((sz，sz))，name ='data_var'，dtype = tf.float64，Expected_shape =(sz，sz)，trainable = False)prefetch_op = tf.assign(temp_var，next_element)#简单的数学运算用于计时myop = tf.sqrt(data_var，name ='myop')#最终预取到GPU操作，将数据从temp_var复制到data_var与tf.control_dependencies((myop，prefetch_op)):Assign_op = tf.assign(data_var，temp_var)#打开会话，初始化并运行1次迭代以预热缓冲区sess = tf.Session()sess.run(tf.global_variables_initializer())sess.run((myop，Assign_op))#Main sess.run进行概要分析tf_options_profiler_on = tf.RunOptions(trace_level = tf.RunOptions.FULL_TRACE)tf_run_metadata = tf.RunMetadata()graph_result = sess.run((myop，assign_op)，options = tf_options_profiler_on，run_metadata = tf_run_metadata)#写入配置文件数据chrome_trace = timeline.Timeline(tf_run_metadata.step_stats).generate_chrome_trace_format()os.makedirs('/tmp/profile'，exist_ok = True)使用open('/tmp/profile/tf_profile_step.json'，'w')为f:f.write(chrome_trace)打印(graph_result)print('将探查器输出到/tmp/profile')

解决方案

在tensorflow 1.7中，数据集API现在具有 prefetch_to_device .

文档:
https://www.tensorflow.org/versions/master/api_docs/python/tf/contrib/data/prefetch_to_device

Github讨论:
https://github.com/tensorflow/tensorflow/issues/13610#issuecomment-364331935

在上述Github讨论(现已结束)中，似乎进一步提到了另一个选项，称为 MultiDeviceIterator .
https://github.com/tensorflow/tensorflow/issues/13610#issuecomment-411893139

As far as I'm aware there is still no asynchronous prefetching of data between CPU/GPU in tensorflow 1.4.

https://github.com/tensorflow/tensorflow/issues/5722

I am trying to code this functionality myself as an exercise in understanding.

The following code attempts to implement this process:

The next batch is stored in a data_var variable.
Process some operations (aka train a model) which gets its batch from the variable data_var, in this example just execute myop which depends on data_var.
A prefetch OP assigns the next batch from a Dataset object (CPU based) into temp_var a gpu-based variable.
Using tf.control_dependencies(...) wait for myop and prefetch_op to complete, then assign temp_var to data_var

This doesn't appear to work. The TF profiler shows that myop is not asynchronously processed with the MEMCPYHtoD process as was hoped for.

I had expected that the two OPs, myop, and prefetch_op would run asynchronously because there are no dependencies between them.

Here is the code I used to run this test. It will run stand-alone.

import tensorflow as tf
from tensorflow.python.client import timeline
import numpy as np
import os

sz = 2000
x = np.random.rand(sz, sz)

def gen():
  yield x

# Dataset
ds = tf.data.Dataset.from_generator(generator=gen, output_types=tf.float64)
ds = ds.repeat()
ds = ds.prefetch(2)
iterator = ds.make_one_shot_iterator()
next_element = iterator.get_next()

# Prefetch to GPU OPs - this is exepected to happen asynchronously
temp_var = tf.Variable(np.zeros((sz, sz)), name='temp_var', dtype=tf.float64, expected_shape=(sz, sz), trainable=False)
data_var = tf.Variable(np.zeros((sz, sz)), name='data_var', dtype=tf.float64, expected_shape=(sz, sz), trainable=False)
prefetch_op = tf.assign(temp_var, next_element)

# Trivial math operation for timing purposes
myop = tf.sqrt(data_var, name='myop')

# Final prefetch to GPU operation, copy data from temp_var to data_var
with tf.control_dependencies((myop, prefetch_op)):
  assign_op = tf.assign(data_var, temp_var)

# Open session, initialize, and run 1 iteration to warm the prefetch buffer
sess = tf.Session()
sess.run(tf.global_variables_initializer())
sess.run((myop, assign_op))

# Main sess.run with profiling on
tf_options_profiler_on = tf.RunOptions(trace_level=tf.RunOptions.FULL_TRACE)
tf_run_metadata = tf.RunMetadata()
graph_result = sess.run((myop, assign_op), options=tf_options_profiler_on, run_metadata=tf_run_metadata)

# Write profile data
chrome_trace = timeline.Timeline(tf_run_metadata.step_stats).generate_chrome_trace_format()
os.makedirs('/tmp/profile', exist_ok=True)
with open('/tmp/profile/tf_profile_step.json', 'w') as f:
  f.write(chrome_trace)

print(graph_result)
print('Writing profiler output to /tmp/profile')

解决方案

In tensorflow 1.7 the Dataset API now has prefetch_to_device.

Documentation:
https://www.tensorflow.org/versions/master/api_docs/python/tf/contrib/data/prefetch_to_device

Github discussion:
https://github.com/tensorflow/tensorflow/issues/13610#issuecomment-364331935

It looks like another option is mentioned further down in the above Github discussion (now closed) called a MultiDeviceIterator.
https://github.com/tensorflow/tensorflow/issues/13610#issuecomment-411893139

这篇关于我可以使用二变量方法在Tensorflow(1.4)中异步预取到GPU吗?的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

我可以使用二变量方法在Tensorflow(1.4)中异步预取到GPU吗? [英] Can I asynchronously prefetch to the GPU in tensorflow (1.4) using a two-variables approach?

问题描述

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

我可以使用二变量方法在Tensorflow(1.4)中异步预取到GPU吗? [英] Can I asynchronously prefetch to the GPU in tensorflow (1.4) using a two-variables approach?

问题描述

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭