CUDA的Mersenne Twister用于任意数量的线程 [英] CUDA's Mersenne Twister for an arbitrary number of threads

查看：150 发布时间：2020/7/4 2:19:59 random cuda mersenne-twister curand

本文介绍了CUDA的Mersenne Twister用于任意数量的线程的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

Mersenne Twister(MT)随机数生成器的CUDA实现仅限于256和200个块/网格的最大线程/块数，即最大线程数是51200

CUDA's implementation of the Mersenne Twister (MT) random number generator is limited to a maximal number of threads/blocks of 256 and 200 blocks/grid, i.e. the maximal number of threads is 51200.

因此，无法启动使用MT的内核

Therefore, it is not possible to launch the kernel that uses the MT with

kernel<<<blocksPerGrid, threadsPerBlock>>>(devMTGPStates, ...)

其中

int blocksPerGrid = (n+threadsPerBlock-1)/threadsPerBlock;

和n是线程总数.

将MT用于threads > 51200的最佳方法是什么?

What is the best way to use the MT for threads > 51200?

我的方法是对blocksPerGrid和threadsPerBlock使用常量值，例如<<<128,128>>>，并在内核代码中使用以下代码:

My approach if to use constant values for blocksPerGrid and threadsPerBlock, e.g. <<<128,128>>> and use the following in the kernel code:

__global__ void kernel(curandStateMtgp32 *state, int n, ...) { 

    int id = threadIdx.x+blockIdx.x*blockDim.x;

    while (id < n) {

        float x = curand_normal(&state[blockIdx.x]);
        /* some more calls to curand_normal() followed
           by the algorithm that works with the data */

        id += blockDim.x*gridDim.x; 
    }
}

我不确定这是正确的方法还是会以不希望的方式影响MT状态?

I am not sure if this is the correct way or if it can influence the MT status in an undesired way?

谢谢.

CUDA的Mersenne Twister用于任意数量的线程 [英] CUDA's Mersenne Twister for an arbitrary number of threads

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

CUDA的Mersenne Twister用于任意数量的线程 [英] CUDA&#39;s Mersenne Twister for an arbitrary number of threads

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

CUDA的Mersenne Twister用于任意数量的线程 [英] CUDA's Mersenne Twister for an arbitrary number of threads

登录关闭