goto指令在CUDA代码中的经内分支的影响 [英] The impact of goto instruction at intra-warp divergence in CUDA code

查看：343 发布时间：2017/3/4 15:33:08 cuda gpu gpgpu simd

本文介绍了goto指令在CUDA代码中的经内分支的影响的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

对于CUDA中的简单内部经线程差异，我知道SM选择一个再会聚点（PC地址），并在两个/多个路径中执行指令，同时禁用没有执行的线程的执行效果

例如，在下面的代码中：

For simple intra-warp thread divergence in CUDA, what I know is that SM selects a re-convergence point (PC address), and executes instructions in both/multiple paths while disabling effects of execution for the threads that haven't taken the path.
For example, in below piece of code:

if( threadIdx.x < 16 ) {
    A:
    // do something.
} else {
    B:
    // do something else.
}
C:
// rest of code.

C warp调度程序在 A 和 B 时调度指令，同时禁用 A 用于较低半弯曲的 B 上的上半弯曲和禁用指令。当它达到 C 时，将为warp内的所有线程启用指令。

C is the re-convergence point, warp scheduler schedules instructions at both A and B, while disabling instructions at A for upper half-warp and disabling instructions at B for lower half-warp. When it reaches C, instructions will be enabled for all the threads inside the warp.

我的问题是SM能够处理包括 goto 指令的代码吗？

例如，如果我在使用 goto 实现的CUDA代码中有以下控制流，

My question is will SM be able to handle the code including the goto instruction properly like above? Or there's no guarantee that chosen re-convergence point is the optimum?
For instance, if I have below control flow in my CUDA code implemented using goto

A:
// some code here.
B:
// some code here too.
if( threadIdx.x < 16 ) {
    C:
    // do something.
    goto A;
}
// do something else.
goto B;

<作为由如果指令引起的内部翘曲发散的再收敛点？

will SM be smart enough to decide B as the re-convergence point for intra-warp divergence caused by if instruction?

goto指令在CUDA代码中的经内分支的影响 [英] The impact of goto instruction at intra-warp divergence in CUDA code

问题描述

推荐答案

相关文章

其它硬件开发最新文章

热门教程

热门工具

登录关闭

goto指令在CUDA代码中的经内分支的影响 [英] The impact of goto instruction at intra-warp divergence in CUDA code

问题描述

推荐答案

相关文章

其它硬件开发最新文章

热门教程

热门工具

登录 关闭

登录关闭