如何从多个CPU线程管理相同的CUDA内核调用？ [英] How to manage same CUDA kernel call from multiple CPU threads?

查看：1278 发布时间：2017/3/5 19:31:50 multithreading cuda thread-safety gpu gpgpu

本文介绍了如何从多个CPU线程管理相同的CUDA内核调用？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个cuda内核，当从单个CPU线程调用时工作正常。然而，当同样是从多个CPU线程（〜100）调用，大多数内核似乎没有执行，因为结果是出来的全零。有人请指导我如何解决这个问题？

I have a cuda kernel which works fine when called from a single CPU threads. However when the same is called from multiple CPU threads (~100), most of the kernel seems not be executed at all as the results comes out to be all zeros.Can someone please guide me how to resolve this problem?

在当前版本的内核中，我使用 cudadevicesynchronize（）在内核调用结束时。将在 cudaMalloc（）之前添加同步命令和内核调用在这种情况下有什么帮助？

In the current version of kernel I am using a cudadevicesynchronize() at the end of kernel call. Will adding a sync command before cudaMalloc() and kernel call be of any help in this case?

还有一件事需要澄清。如果两个CPU线程执行相同的cudaMalloc（）命令，以后会覆盖以前在GPU内存还是会创建自己的内存？

There is another thing which need some clarification. i.e. If two CPU threads executes the same cudaMalloc() command, will the later overwrite the former in GPU memory or will they create their own memory?

help

如何从多个CPU线程管理相同的CUDA内核调用？ [英] How to manage same CUDA kernel call from multiple CPU threads?

问题描述

推荐答案

相关文章

其它硬件开发最新文章

热门教程

热门工具

登录关闭

如何从多个CPU线程管理相同的CUDA内核调用？ [英] How to manage same CUDA kernel call from multiple CPU threads?

问题描述

推荐答案

相关文章

其它硬件开发最新文章

热门教程

热门工具

登录 关闭

登录关闭