Pytorch:RuntimeError:减少同步失败:cudaErrorAssert:设备端断言已触发 [英] Pytorch: RuntimeError: reduce failed to synchronize: cudaErrorAssert: device-side assert triggered

查看：1245 发布时间：2020/5/4 10:06:36 python machine-learning deep-learning pytorch

本文介绍了Pytorch:RuntimeError:减少同步失败:cudaErrorAssert:设备端断言已触发的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

尝试训练此数据集上的.

I am running into the following error when trying to train this on this dataset.

由于这是论文中发布的配置，因此我假设我做错了非常大的事.

Since this is the configuration published in the paper, I am assuming I am doing something incredibly wrong.

每次我尝试进行训练时，此错误都会在不同的图像上出现.

This error arrives on a different image every time I try to run training.

C:/w/1/s/windows/pytorch/aten/src/THCUNN/ClassNLLCriterion.cu:106: block: [0,0,0], thread: [6,0,0] Assertion `t >= 0 && t < n_classes` failed.
Traceback (most recent call last):
  File "C:\Program Files\JetBrains\PyCharm Community Edition 2019.1.1\helpers\pydev\pydevd.py", line 1741, in <module>
    main()
  File "C:\Program Files\JetBrains\PyCharm Community Edition 2019.1.1\helpers\pydev\pydevd.py", line 1735, in main
    globals = debugger.run(setup['file'], None, None, is_module)
  File "C:\Program Files\JetBrains\PyCharm Community Edition 2019.1.1\helpers\pydev\pydevd.py", line 1135, in run
    pydev_imports.execfile(file, globals, locals)  # execute the script
  File "C:\Program Files\JetBrains\PyCharm Community Edition 2019.1.1\helpers\pydev\_pydev_imps\_pydev_execfile.py", line 18, in execfile
    exec(compile(contents+"\n", file, 'exec'), glob, loc)
  File "C:/Noam/Code/vision_course/hopenet/deep-head-pose/code/original_code_augmented/train_hopenet_with_validation_holdout.py", line 187, in <module>
    loss_reg_yaw = reg_criterion(yaw_predicted, label_yaw_cont)
  File "C:\Noam\Code\vision_course\hopenet\venv\lib\site-packages\torch\nn\modules\module.py", line 541, in __call__
    result = self.forward(*input, **kwargs)
  File "C:\Noam\Code\vision_course\hopenet\venv\lib\site-packages\torch\nn\modules\loss.py", line 431, in forward
    return F.mse_loss(input, target, reduction=self.reduction)
  File "C:\Noam\Code\vision_course\hopenet\venv\lib\site-packages\torch\nn\functional.py", line 2204, in mse_loss
    ret = torch._C._nn.mse_loss(expanded_input, expanded_target, _Reduction.get_enum(reduction))
RuntimeError: reduce failed to synchronize: cudaErrorAssert: device-side assert triggered

有什么想法吗?

Any ideas?

Pytorch:RuntimeError:减少同步失败:cudaErrorAssert:设备端断言已触发 [英] Pytorch: RuntimeError: reduce failed to synchronize: cudaErrorAssert: device-side assert triggered

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

Pytorch:RuntimeError:减少同步失败:cudaErrorAssert:设备端断言已触发 [英] Pytorch: RuntimeError: reduce failed to synchronize: cudaErrorAssert: device-side assert triggered

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭