计算机以 TensorFlow 中的大批量小批量重启 [英] Computer restarts with large mini batches in TensorFlow

查看:28
本文介绍了计算机以 TensorFlow 中的大批量小批量重启的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在使用 Titan X GPU(12 GB 内存)为 Windows 运行 TensorFlow.当我尝试用超过 50 张图像的小批量训练 256X256X1 图像的网络时,我的计算机只是崩溃并自动重新启动.对于较小的小批量,它运行得很好.关于可能导致这种情况的任何线索?

I am running TensorFlow for Windows with a Titan X GPU (12 GB memory). When I try to train a network for images of 256X256X1 with mini-batches larger than 50 images, my computer just crashes and restarts automatically. With smaller mini-batches it runs just fine. Any clues on what might be causing this?

推荐答案

我在一些游戏论坛中看到过类似的问题,在这些论坛中,当 GPU 负载过重时,PC 就会关闭.原因通常是 GPU 消耗的功率超过电源单元的处理能力.检查例如此处此处.因此,可能值得调查您的 PSU 是否是罪魁祸首.

I've seen similar problems being discussed in some gaming forums, where the PC would just shut down when the GPU was under heavy load. The reason was usually that the GPU was drawing more power than the power supply unit could handle. Check e.g. here or here. So may be it's worth investigating whether your PSU is the culprit.

可能是 SpeedFan 程序可以帮助您进行调试 - 它能够显示电压和温度传感器的读数,它还可以告诉您您的 PC 是否过热(我自己从未使用过该工具,并且我也不隶属于它,只是在网上找到的).

May be the program SpeedFan can help you debugging this - it is able to show both voltages and readings of temperature sensors, which would also tell you if your PC is overheating (I've never used the tool myself, and I'm not affiliated with it either, just found it online).

这篇关于计算机以 TensorFlow 中的大批量小批量重启的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆