为什么会有CL_DEVICE_MAX_WORK_GROUP_SIZE? [英] Why is there a CL_DEVICE_MAX_WORK_GROUP_SIZE?

查看：487 发布时间：2020/5/20 18:57:59 opencl

本文介绍了为什么会有CL_DEVICE_MAX_WORK_GROUP_SIZE?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我试图了解诸如GPU之类的OpenCL设备的体系结构，但我看不到为什么本地工作组中的工作项数量有明确的界限，即常量CL_DEVICE_MAX_WORK_GROUP_SIZE.

I'm trying to understand the architecture of OpenCL devices such as GPUs, and I fail to see why there is an explicit bound on the number of work items in a local work group, i.e. the constant CL_DEVICE_MAX_WORK_GROUP_SIZE.

在我看来，这应该由编译器来处理，即，如果使用本地工作组大小为500的内核(为简化起见，以一维形式执行)，而其物理最大值为100，则该内核看起来像这个:

It seems to me that this should be taken care of by the compiler, i.e. if a (one-dimensional for simplicity) kernel is executed with local workgroup size 500 while its physical maximum is 100, and the kernel looks for example like this:

__kernel void test(float* input) {
    i = get_global_id(0);
    someCode(i);
    barrier();
    moreCode(i);
    barrier();
    finalCode(i);
}

然后可以将其自动转换为该内核上工作组大小为100的执行:

then it could be converted automatically to an execution with work group size 100 on this kernel:

__kernel void test(float* input) {
    i = get_global_id(0);
    someCode(5*i);
    someCode(5*i+1);
    someCode(5*i+2);
    someCode(5*i+3);
    someCode(5*i+4);
    barrier();
    moreCode(5*i);
    moreCode(5*i+1);
    moreCode(5*i+2);
    moreCode(5*i+3);
    moreCode(5*i+4);
    barrier();
    finalCode(5*i);
    finalCode(5*i+1);
    finalCode(5*i+2);
    finalCode(5*i+3);
    finalCode(5*i+4);
}

但是，似乎默认情况下未完成此操作.为什么不?有没有办法使这个过程自动化(除了自己编写一个预编译器之外)?还是存在一个内在的问题，可能使我的方法在某些示例上失败(您可以给我一个示例)?

However, it seems that this is not done by default. Why not? Is there a way to make this process automated (other than writing a pre-compiler for it myself)? Or is there an intrinsic problem which can make my method fail on certain examples (and can you give me one)?

为什么会有CL_DEVICE_MAX_WORK_GROUP_SIZE? [英] Why is there a CL_DEVICE_MAX_WORK_GROUP_SIZE?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么会有CL_DEVICE_MAX_WORK_GROUP_SIZE? [英] Why is there a CL_DEVICE_MAX_WORK_GROUP_SIZE?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭