为什么nvprof和nvidia-smi报告的电源结果不同？ [英] why do nvprof and nvidia-smi report different results on power?

查看：312 发布时间：2020/10/13 1:29:13 cuda profiling

本文介绍了为什么nvprof和nvidia-smi报告的电源结果不同？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我分别使用nvprof和nvidia-smi监视GPU功耗，但是观察到了不同的结果，总结在下表中。

I used nvprof and nvidia-smi to monitor the GPU power dissipation respectively, but observed different results, summarized in the table below.

----------------------------------------------------------------
gpu     |             busy           |             idle         
model   |  nvprof[Watt]  smi[Watt]   |  nvprof[Watt]  smi[Watt] 
----------------------------------------------------------------
M2090   |   ~151           ~151      |     ~100          ~75
K20     |   ~105           ~102      |     ~63           ~43
----------------------------------------------------------------

注释0：忙表示我的代码正在受监视的GPU上运行

note 0: "busy" means my code is running on the monitored GPU

注释1：nvprof报告所有设备的电源。因此，我要使用nvprof为特定GPU获得空闲功能的方法就是简单地使代码在另一个GPU上运行。

note 1: nvprof reports the power for all the devices. So my way to get the "idle" power using nvprof for a specific GPU is simply to have the code running on another GPU.

注2：nvidia-smi报告了一个关于功率的几个不同数量，但我主要关注功率消耗

note 2: nvidia-smi reports a couple of different quantities about power, but I was focusing on "power draw"

注3：cuda版本：5.5

note 3: cuda version: 5.5

所以我的问题是：为什么nvidia-smi报告的功率通常小于nvprof，为什么在监视空闲功率时这种差异会变得更大？最终，我应该更信任哪个公用事业？

So my question is: why is the power reported by nvidia-smi generally smaller than nvprof, and why does this discrepancy become larger when the idle power is monitored? and ultimately, which utility should I trust more?

还要确保两个公用事业所测量的功率是否指的是输入电功率（P = I * U）而不是输出热功率，对吧？

Also, just to make sure, does the power that the two utilies measure refer to the input electric power (P=I*U) rather than the output thermal power, right?

非常感谢您提供任何建议！

Thanks a lot for any advice!

更新
@njuffa和@talonmies的猜测很有意义。因此，我对smi进行了更多研究以进行功率分析。结果对我来说没有意义。

Update @njuffa and @talonmies 's speculation makes very good sense. So I explored smi a little bit more for power analysis. The results, however, do not make sense to me.

附加说明：

红色数据的不连续性是因为我直接使用了smi报告的
时间戳，它的分辨率（秒）低。此外，用于说明目的p0的
分配了20的数值，
p1的数值为10。因此，在大多数情况下，GPU处于其完整的
性能状态（这是 odd ），但忙碌情况除外，在这种情况下，GPU
在15〜18s内下降到p1（ odd ）。

The discontinuity of the red data is because I directly used the timestamp reported by smi, which has low resolution (sec). Besides, for illustration purpose p0 is assigned an numerical value of 20 and p1 of 10. So for most of the time, the GPU is put into its full performance state (this is odd), except for the "busy" case, where the GPU somehow drops to p1 during 15~18s (odd).

直到大约21.3秒才第一次调用cudaSetDevice（）。因此，在约18s
处发生的功率上升和p状态变化相当奇数。

It is not until ~21.3s that cudaSetDevice() is invoked for the very first time. So the power rise and p-state change that occurs at ~18s is rather odd.

繁忙功率是在将我的GPU代码设置为后台时进行测量的，
和smi进入无限循环以反复查询功率并p状态
直到后台进程终止。闲置功率为
，只需通过发射smi 50次即可测得。显然，在后者的
情况下，smi的开销更大，再次是 odd 。

"busy power" is measured when my GPU code is set to the background, and smi put into an infinite loop to query the power and p-state repeatedly until the background process terminates. "idle power" is measured simply by launching smi 50 times. Apparently in the latter case, smi exhibits larger overhead, which is again, odd.

为什么nvprof和nvidia-smi报告的电源结果不同？ [英] why do nvprof and nvidia-smi report different results on power?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么nvprof和nvidia-smi报告的电源结果不同？ [英] why do nvprof and nvidia-smi report different results on power?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭