CUDA代码会损坏GPU吗? [英] Can CUDA code damage a GPU?
问题描述
在测试包含内存错误的一块CUDA时,我的屏幕被冻结。重新启动后,我无法再检测到显卡。我的代码是否有可能损坏卡?
While testing a piece of CUDA containing a memory bug, my screen got frozen. After rebooting I cannot detect anymore the graphics card. Is it possible that my code physically damaged the card?
这发生在Ubuntu 14.04下。我不知道卡的模型,因为我不能检测到它,但我记得它是一个相当新的。
This happened under Ubuntu 14.04. I don't know the model of the card, as I cannot detect it but I remember it is a fairly new one.
推荐答案
感谢所有的意见,我解决了这个问题。
Thanks to all the comments I solved the problem.
我将列出我承诺的操作。我不知道他们是否都有效果,但最终问题解决了。
I will list the actions that I undertook. I'm not sure if all of them had an effect but eventually the problem got solved.
首先我断开了显卡,没有它重新启动。之后,我再次插入卡并重新启动。我被抛到一个菜单,据说我在低图形模式下运行。我打开一个 tty
( ctrl + alt + 1 请使用此处说明安装Nvidia驱动程序。
First I disconnected the graphics card and rebooted without it. Afterwards I plugged the card again and rebooted. I was thrown to a menu where it was said that I was running in low graphics mode. I open a tty
(ctrl+alt+1) and tried to re-install the Nvidia drivers using the instructions here.
它最初失败了,因为新手的司机正在运行(我认为这是整个问题的主要罪魁祸首)。
It initially failed because the nouveau drivers were running (which I think is the main culprit of the whole problem).
我将此链接列入黑名单。
总之,创建文件/ etc / modprobe.d / blacklist-nouveau.conf
,然后添加:
In summary create the file /etc/modprobe.d/blacklist-nouveau.conf
and add:
blacklist nouveau
blacklist lbm-nouveau
options nouveau modeset=0
alias nouveau off
alias lbm-nouveau off
然后我重新启动。然后我的屏幕开始正常工作,但我不能启动 ubuntu
桌面。我重新安装了cuda驱动程序(有一些错误但不是致命错误)。
Then I rebooted. By then my screen started to work properly but I couldn't start the ubuntu
desktop. I reinstalled the cuda drivers (there were a few errors but not fatal errors).
然后我重新启动,我的屏幕重新工作。
Then I rebooted and my screen was working again.
回答主要问题:我没有通过测试 CUDA
代码损坏显卡。
Answering the main question: I did not damage the graphics card by testing CUDA
code.
这篇关于CUDA代码会损坏GPU吗?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!