每个网格的最大块数:CUDA [英] Maximum blocks per grid:CUDA
问题描述
每个内核启动时可以创建的网格中的最大块数是多少?这里有点困惑因为
What is the maximum number of blocks in a grid that can created per kernel launch? I am slightly confused here since
现在计算能力表这里说,在CUDA计算能力2.0中每个网格可以有65535个块。
Now the compute capability table here says that there can be 65535 blocks per grid dimemsion in CUDA compute capability 2.0.
这意味着块的总数= 65535 * 65535?
Does that mean the total number of blocks = 65535*65535?
或者说这意味着你最多可以将65535重新排列成65536块的1d网格
或sqrt(65535)的2d网格* sqrt(65535) ?
Or does it mean that you can rearrange at most 65535 into a 1d grid of 65536 blocks or 2d grid of sqrt(65535) * sqrt(65535) ?
谢谢。
推荐答案
65535每个网格的维度。在计算1.x卡上,支持1D和2D网格。在计算2.x卡上,还支持3D网格,因此65535,65535 x 65535和65535 x 65535 x 65535是Fermi(计算2.x)卡的限制。
65535 per dimension of the grid. On compute 1.x cards, 1D and 2D grids are supported. On compute 2.x cards, 3D grids are also supported, so 65535, 65535 x 65535, and 65535 x 65535 x 65535 are the limits for Fermi (compute 2.x) cards.
这篇关于每个网格的最大块数:CUDA的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!