Tensorflow CUDA-CUPTI错误:无法加载CUPTI或找不到符号 [英] Tensorflow CUDA - CUPTI error: CUPTI could not be loaded or symbol could not be found

查看:2990
本文介绍了Tensorflow CUDA-CUPTI错误:无法加载CUPTI或找不到符号的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我使用Tensorflow v 1.14.0.我在Windows 10上工作.这是环境变量PATH中的外观:

I use the Tensorflow v 1.14.0. I work on Windows 10. And here is how relevant environment variables look in the PATH:

C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0\bin
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0\libnvvp
C:\Program Files (x86)\NVIDIA Corporation\PhysX\Common
C:\Users\sinthes\AppData\Local\Programs\Python\Python37
C:\Users\sinthes\AppData\Local\Programs\Python\Python37\Scripts
C:\Program Files\NVIDIA Corporation\NVIDIA NvDLISR
C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v10.0\cuda\bin

也许还值得一提,以防万一..我使用Sublime Text 3进行开发,而不使用Anaconda.我发现在conda环境中对tensorflow进行更新有点麻烦,因此我现在仅使用Sublime Text. (我以前使用的是Anaconda(Spyder),但已从计算机上卸载了它.)

Maybe also worth to mention, just in case it might be relevant.. I use Sublime Text 3 for development and I do not use Anaconda. I find it a bit cumbersome to make updates on tensorflow in the conda environment so I just use Sublime Text right now. (I was using Anaconda (Spyder) previously but I uninstalled it from my computer.)

除了偶尔出现一些奇怪的警告,一切似乎都可以正常工作.但是,每当我运行fit函数时,都会得到一个一致的警告.

Things seem to work fine except with some occasional strange warnings. But one consistent warning I get is the following whenever I run the fit function.

E tensorflow/core/platform/default/device_tracer.cc:68] CUPTI error: CUPTI could not be loaded or symbol could not be found.

这就是我所说的fit函数:

And here is how I call the fit function:

history = model.fit(x=train_x,
                    y=train_y,
                    batch_size=BATCH_SIZE,
                    epochs=110,
                    verbose=2,
                    callbacks=[tensorboard, checkpoint, reduce_lr_on_plateau],
                    validation_data=(dev_x, dev_y),
                    shuffle=True,
                    class_weight=class_weight,
                    steps_per_epoch=None,
                    validation_steps=None)

我只是想知道为什么在运行时会看到 CUPTI Error 消息?它仅打印一次.是我需要修复的东西还是可以忽略的东西?此消息并没有告诉我要采取任何措施的具体内容.

I just wonder why I see the CUPTI Error message during the run time? It is only printed out once. Is that something that I need to fix or is it something that can be ignored? This message does not tell anything concrete to me to be able to take any action.

推荐答案

以下是解决我的"问题的方法:

Here is what solved "my" problem:

我刚刚用tensorflow v 1.13.1替换了tensorflow v 1.14.不再有 CUPTI错误消息.甚至其他一些奇怪的警告/问题也已消失.所有问题显然都应具有特定的原因,但不幸的是,Tensorflow(多次)没有提供可理解的错误/警告消息,而这些错误/警告消息却给出了有助于解决问题的良好/公正的主意.而且我最终花了几个小时(甚至几天)来解决这些奇怪的问题,这极大地降低了我的生产率.

I just replaced my tensorflow v 1.14 with tensorflow v 1.13.1. And no more CUPTI error messages. And even some other strange warnings / problems have disappeared. All issues should obviously have specific reasons but Tensorflow (many times) unfortunately does not provide understandable error/warning messages that give a good/fair idea that helps to solve the issue. And I end up spending hours (even days) on such strange problems, that reduces my productivity significantly.

对我来说(可以在这里分享)可能是对我的一个一般性学习,那就是我不应该急于将我的tensorflow安装升级到最新版本.最新的版本几乎永远不会稳定,每当我尝试一下时,我就花了大量的时间来解决由tensorflow引起的问题.糟糕的文档和错误消息使使用起来非常困难.

One general learning for me (that might be relevant to share here) is that I should not be in hurry to upgrade my tensorflow installation to the latest version of it. The latest one is almost never stable, whenever I made a try, I ended up spending significant amount of time on problems that are caused by tensorflow. Poor documentation and error messages make it very very difficult to work with.

如果有人有更好的答案,非常欢迎他/她对我在此问题中分享的问题发表自己的见解.

If anyone has a better answer, s/he is more than welcome to share his/her insights on the issue I shared in this question.

这篇关于Tensorflow CUDA-CUPTI错误:无法加载CUPTI或找不到符号的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆