multi-gpu cuda：在一个设备上运行内核并在另一个设备上修改元素？ [英] multi-gpu cuda: Run kernel on one device and modify elements on the other?

查看：294 发布时间：2017/3/5 19:14:32 cuda gpu

本文介绍了multi-gpu cuda：在一个设备上运行内核并在另一个设备上修改元素？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

假设我在一台机器上有多个GPU，并且有一个内核在GPU0上运行。

Suppose I have multiple GPU's in a machine and I have a kernel running on GPU0.

使用CUDA 4.0的UVA和P2P功能，当内核在GPU0上运行时，我可以修改另一个设备上的阵列的内容，例如GPU1吗？

With the UVA and P2P features of CUDA 4.0, can I modify the contents of an array on another device say GPU1 when the kernel is running on GPU0?

CUDA 4.0 SDK中的simpleP2P示例不能演示这一点。

The simpleP2P example in the CUDA 4.0 SDK does not demonstrate this.

这只会演示：

Peer-to-peer memcopies
A kernel running on GPU0 which reads input from GPU1 buffer and writes output to GPU0 buffer

在GPU1上运行的内核，从GPU0缓冲区读取输入并将输出写入GPU1缓冲区

A kernel running on GPU1 which reads input from GPU0 buffer and writes output to GPU1 buffer