CUDA-nvidia驱动程序在运行时崩溃 [英] CUDA - nvidia driver crash while running

查看:84
本文介绍了CUDA-nvidia驱动程序在运行时崩溃的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我在CUDA中运行了N次弹跳的光线追踪器(每条光线会弹跳N次).

I run a raytracer in CUDA with N Bounces (each ray will bounce N times).

我使用openGL查看结果.

I view the results using openGL.

一旦N变小(1〜4),一切正常.一旦我使N大(〜10),每个线程(大约800x1000)就必须进行大量计算,并且当屏幕变黑时,然后重新打开,并注意我的nvidia崩溃.

once N is small (1~4) everything works great. once i make N big (~10) each thread (about 800x1000) has to do a lot of computing and this when the screen goes black, and than back on, with the note that my nvidia crash.

我在网上搜索,现在想想是什么原因导致了某种看门狗定时器,因为我在显示和计算时使用了相同的图形卡(计算需要2秒钟以上的时间,因此驱动程序会自行重置).

i searched online and think now that what cause it some sort of a watch-dog timer since i use the same graphic card for my display and my computing (computing takes more than 2 sec so the driver reset itself).

是否有一条命令可以使设备(gpu)的主机(cpu)持续使用时间长?

is there a command to make the host (cpu) WAIT for the device(gpu) for as long as it takes?

我需要做什么?我卡住了:(

what do i need to do? im stuck :(

谢谢

推荐答案

根据您的描述,您正在Windows Vista或Windows 7上运行.正如您所猜测的那样,Windows操作系统具有看门狗计时器.看门狗计时器仅适用于已连接显示器的GPU.

Based on your description, you are running on Windows Vista or Windows 7. Windows operating systems have a watchdog timer, as you guessed. The watchdog timer only applies to GPUs with displays attached.

最简单的解决方案是运行2个或更多GPU,并在不连接显示器的GPU上运行CUDA.

The easiest solution is to run 2 or more GPUs, and run CUDA on GPU(s) without a display attached.

您可以禁用看门狗定时器.有关更多详细信息,请参见此问题.但是,您应该谨慎行事-记住,当主显示GPU上的内核运行时间很长时,您将使计算机完全无响应(至少您看不到它在做什么),直到内核完成为止.

You can disable the watchdog timer. See this question for more details. However you should do so with care—remember that when you have a long running kernel on your primary display GPU you will make your computer completely unresponsive (at least you won't be able to see what it is doing) until the kernel completes.

这篇关于CUDA-nvidia驱动程序在运行时崩溃的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆