为什么 CUDA 中的常量内存大小受到限制? [英] Why is the constant memory size limited in CUDA?

查看：25 发布时间：2022/1/10 15:59:10 cuda gpgpu gpu-constant-memory

本文介绍了为什么 CUDA 中的常量内存大小受到限制?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

根据 CUDA C 编程指南";，只有在命中多处理器常量缓存时，常量内存访问才会受益(第 5.3.2.4 节)¹.否则，半扭曲的内存请求可能比合并全局内存读取的情况更多.那么为什么恒定的内存大小限制为 64 KB?

According to "CUDA C Programming Guide", a constant memory access benefits only if a multiprocessor constant cache is hit (Section 5.3.2.4)¹. Otherwise there can be even more memory requests for a half-warp than in case of the coalesced global memory read. So why the constant memory size is limited to 64 KB?

为了不问两次，再问一个问题.据我了解，在 Fermi 架构中，纹理缓存与 L2 缓存相结合.纹理使用是否仍然有意义，或者全局内存读取以相同的方式缓存?

One more question in order not to ask twice. As far as I understand, in the Fermi architecture the texture cache is combined with the L2 cache. Does texture usage still make sense or the global memory reads are cached in the same manner?

¹恒定内存(第 5.3.2.4 节)

常量内存空间驻留在设备内存中，缓存在F.3.1和F.4.1节中提到的常量缓存中.

The constant memory space resides in device memory and is cached in the constant cache mentioned in Sections F.3.1 and F.4.1.

对于计算能力为 1.x 的设备，对 warp 的恒定内存请求首先被拆分为两个请求，每个请求一个半warp，它们独立发出.

For devices of compute capability 1.x, a constant memory request for a warp is first split into two requests, one for each half-warp, that are issued independently.

然后将请求拆分为与初始请求中不同的内存地址一样多的单独请求，从而将吞吐量降低与单独请求数量相等的因子.

A request is then split into as many separate requests as there are different memory addresses in the initial request, decreasing throughput by a factor equal to the number of separate requests.

然后在缓存命中的情况下以常量缓存的吞吐量为结果请求提供服务，否则以设备内存的吞吐量提供服务.

The resulting requests are then serviced at the throughput of the constant cache in case of a cache hit, or at the throughput of device memory otherwise.

为什么 CUDA 中的常量内存大小受到限制? [英] Why is the constant memory size limited in CUDA?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么 CUDA 中的常量内存大小受到限制? [英] Why is the constant memory size limited in CUDA?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭