Cuda全局内存加载和存储 [英] Cuda global memory load and store

查看：110 发布时间：2021/4/27 20:11:16 cuda gpu

本文介绍了Cuda全局内存加载和存储的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

因此，我试图隐藏全局内存延迟.输入以下代码:

So I am trying to hide global memory latency. Take the following code:

for(int i = 0; i < N; i++){
     x = global_memory[i];

     ... do some computation on x ...

     global_memory[i] = x;
}

我想知道全局内存中的加载和存储是否正在阻塞，即直到加载或存储完成后才运行下一行.例如，使用以下代码:

I wanted to know whether load and store from global memory is blocking, i.e, it doesn't run next line until load or store is finished. For example take the following code:

x_next = global_memory[0];
for(int i = 0; i < N; i++){
     x = x_next;
     x_next = global_memory[i+1];

     ... do some computation on x ...

     global_memory[i] = x;
}

在此代码中，直到下一次迭代时才使用x_next，因此加载x_next与计算重叠吗?换句话说，以下哪个数字将会发生?

In this code, x_next is not used until next iteration, so does loading x_next overlap with the computation? In other words, which of the following figures will happen?

Cuda全局内存加载和存储 [英] Cuda global memory load and store

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Cuda全局内存加载和存储 [英] Cuda global memory load and store

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭