首先tf.session.run（）的执行与以后的运行截然不同。为什么？ [英] First tf.session.run() performs dramatically different from later runs. Why?

查看：683 发布时间：2020/6/8 18:58:34 tensorflow cublas cudnn tensorflow-gpu tensorflow-xla

本文介绍了首先tf.session.run（）的执行与以后的运行截然不同。为什么？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

这里是一个示例来阐明我的意思：

第一次session.run（）：

首次运行TensorFlow会话

Here's an example to clarify what I mean:
First session.run():
First run of a TensorFlow session

稍后session.run（）：

< a href = https://i.stack.imgur.com/cKjb1.png rel = noreferrer>稍后运行TensorFlow会话

Later session.run():
Later runs of a TensorFlow session

我了解TensorFlow在这里进行了一些初始化，但是我想知道它在源代码中出现的位置。这在CPU和GPU上均会发生，但在GPU上的影响更为明显。例如，在显式Conv2D操作的情况下，第一次运行在GPU流中具有大量Conv2D操作。实际上，如果我更改Conv2D的输入大小，则它可以从数十个流转换为Conv2D操作。但是，在以后的运行中，GPU流中始终只有五个Conv2D操作（与输入大小无关）。在CPU上运行时，与以后的运行相比，我们在第一次运行中保留了相同的操作列表，但我们确实看到了相同的时间差异。

I understand TensorFlow is doing some initialization here, but I'd like to know where in the source this manifests. This occurs on CPU as well as GPU, but the effect is more prominent on GPU. For example, in the case of a explicit Conv2D operation, the first run has a much larger quantity of Conv2D operations in the GPU stream. In fact, if I change the input size of the Conv2D, it can go from tens to hundreds of stream Conv2D operations. In later runs, however, there are always only five Conv2D operations in the GPU stream (regardless of input size). When running on CPU, we retain the same operation list in the first run compared to later runs, but we do see the same time discrepancy.

TensorFlow源的哪一部分是对这种行为负责？ GPU操作在哪里分裂？

What portion of TensorFlow source is responsible for this behavior? Where are GPU operations "split?"

感谢帮助！

首先tf.session.run（）的执行与以后的运行截然不同。为什么？ [英] First tf.session.run() performs dramatically different from later runs. Why?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

首先tf.session.run（）的执行与以后的运行截然不同。为什么？ [英] First tf.session.run() performs dramatically different from later runs. Why?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭