Tensorflow CUDA-CUPTI错误:无法加载CUPTI或找不到符号 [英] Tensorflow CUDA - CUPTI error: CUPTI could not be loaded or symbol could not be found
问题描述
我使用Tensorflow v 1.14.0.我在Windows 10上工作.这是环境变量在PATH
中的外观:
I use the Tensorflow v 1.14.0. I work on Windows 10. And here is how relevant environment variables look in the PATH
:
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0\bin
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0\libnvvp
C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common
C:\Users\sinthes\AppData\Local\Programs\Python\Python37
C:\Users\sinthes\AppData\Local\Programs\Python\Python37\Scripts
C:\Program Files\NVIDIA Corporation\NVIDIA NvDLISR
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0\cuda\bin
也许还值得一提,以防万一..我使用Sublime Text 3进行开发,而不使用Anaconda.我发现在conda环境中对tensorflow进行更新有点麻烦,因此我现在仅使用Sublime Text. (我以前使用的是Anaconda(Spyder),但已从计算机上卸载了它.)
Maybe also worth to mention, just in case it might be relevant.. I use Sublime Text 3 for development and I do not use Anaconda. I find it a bit cumbersome to make updates on tensorflow in the conda environment so I just use Sublime Text right now. (I was using Anaconda (Spyder) previously but I uninstalled it from my computer.)
除了偶尔出现一些奇怪的警告,一切似乎都可以正常工作.但是,每当我运行fit
函数时,都会得到一个一致的警告.
Things seem to work fine except with some occasional strange warnings. But one consistent warning I get is the following whenever I run the fit
function.
E tensorflow/core/platform/default/device_tracer.cc:68] CUPTI error: CUPTI could not be loaded or symbol could not be found.
这就是我所说的fit函数:
And here is how I call the fit function:
history = model.fit(x=train_x,
y=train_y,
batch_size=BATCH_SIZE,
epochs=110,
verbose=2,
callbacks=[tensorboard, checkpoint, reduce_lr_on_plateau],
validation_data=(dev_x, dev_y),
shuffle=True,
class_weight=class_weight,
steps_per_epoch=None,
validation_steps=None)
我只是想知道为什么在运行时会看到 CUPTI Error
消息?它仅打印一次.是我需要修复的东西还是可以忽略的东西?此消息并没有告诉我要采取任何措施的具体内容.
I just wonder why I see the CUPTI Error
message during the run time? It is only printed out once. Is that something that I need to fix or is it something that can be ignored? This message does not tell anything concrete to me to be able to take any action.
推荐答案
以下是解决我的"问题的方法:
Here is what solved "my" problem:
我刚刚用tensorflow v 1.13.1
替换了tensorflow v 1.14
.不再有 CUPTI错误消息.甚至其他一些奇怪的警告/问题也已消失.所有问题显然都应具有特定的原因,但不幸的是,Tensorflow(多次)没有提供可理解的错误/警告消息,而这些错误/警告消息却给出了有助于解决问题的良好/公正的主意.而且我最终花了几个小时(甚至几天)来解决这些奇怪的问题,这极大地降低了我的生产率.
I just replaced my tensorflow v 1.14
with tensorflow v 1.13.1
. And no more CUPTI error messages. And even some other strange warnings / problems have disappeared. All issues should obviously have specific reasons but Tensorflow (many times) unfortunately does not provide understandable error/warning messages that give a good/fair idea that helps to solve the issue. And I end up spending hours (even days) on such strange problems, that reduces my productivity significantly.
对我来说(可以在这里分享)可能是对我的一个一般性学习,那就是我不应该急于将我的tensorflow安装升级到最新版本.最新的版本几乎永远不会稳定,每当我尝试一下时,我就花了大量的时间来解决由tensorflow引起的问题.糟糕的文档和错误消息使使用起来非常困难.
One general learning for me (that might be relevant to share here) is that I should not be in hurry to upgrade my tensorflow installation to the latest version of it. The latest one is almost never stable, whenever I made a try, I ended up spending significant amount of time on problems that are caused by tensorflow. Poor documentation and error messages make it very very difficult to work with.
如果有人有更好的答案,非常欢迎他/她对我在此问题中分享的问题发表自己的见解.
If anyone has a better answer, s/he is more than welcome to share his/her insights on the issue I shared in this question.
这篇关于Tensorflow CUDA-CUPTI错误:无法加载CUPTI或找不到符号的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!