每个网格的最大块数:CUDA [英] Maximum blocks per grid:CUDA

查看:288
本文介绍了每个网格的最大块数:CUDA的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

每个内核启动时可以创建的网格中的最大块数是多少?这里有点困惑因为

What is the maximum number of blocks in a grid that can created per kernel launch? I am slightly confused here since

现在计算能力表这里说,在CUDA计算能力2.0中每个网格可以有65535个块。

Now the compute capability table here says that there can be 65535 blocks per grid dimemsion in CUDA compute capability 2.0.

这意味着块的总数= 65535 * 65535?

Does that mean the total number of blocks = 65535*65535?

或者说这意味着你最多可以将65535重新排列成65536块的1d网格
或sqrt(65535)的2d网格* sqrt(65535) ?

Or does it mean that you can rearrange at most 65535 into a 1d grid of 65536 blocks or 2d grid of sqrt(65535) * sqrt(65535) ?

谢谢。

推荐答案

65535每个网格的维度。在计算1.x卡上,支持1D和2D网格。在计算2.x卡上,还支持3D网格,因此65535,65535 x 65535和65535 x 65535 x 65535是Fermi(计算2.x)卡的限制。

65535 per dimension of the grid. On compute 1.x cards, 1D and 2D grids are supported. On compute 2.x cards, 3D grids are also supported, so 65535, 65535 x 65535, and 65535 x 65535 x 65535 are the limits for Fermi (compute 2.x) cards.

这篇关于每个网格的最大块数:CUDA的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆