是否有更好的方法来处理“按block_size的不可计数的数字"?在CUDA? [英] Is there a better way to process "undividable count of numbers by block_size" in CUDA?

查看：75 发布时间：2020/5/24 21:20:35 cuda parallel-processing

本文介绍了是否有更好的方法来处理“按block_size的不可计数的数字"?在CUDA?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我需要对N个数字的向量进行数据约简(查找k-最大数).问题是我事先不知道N(在编译之前)，并且在构造两个内核时，我不确定我是否做对了-一个带有(int)(N / block_size)块的内核，第二个带有一个块的内核. N % block_size线程.

I need to do data reduction (find k-max number) on vector of N numbers. The problem is I don't know the N beforehand (before compilation), and I am not sure if I'm doing it right when I'm constructing two kernels - one with (int)(N / block_size) blocks and the second kernel with one block of N % block_size threads.

是否有更好的方法来处理CUDA中按block_size进行的不可区分的"数字计数?

是否有更好的方法来处理“按block_size的不可计数的数字"?在CUDA? [英] Is there a better way to process "undividable count of numbers by block_size" in CUDA?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

是否有更好的方法来处理“按block_size的不可计数的数字"?在CUDA? [英] Is there a better way to process &quot;undividable count of numbers by block_size&quot; in CUDA?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

是否有更好的方法来处理“按block_size的不可计数的数字"?在CUDA? [英] Is there a better way to process "undividable count of numbers by block_size" in CUDA?

登录关闭