GPU上的分支预测 [英] Branch predication on GPU

查看：1354 发布时间：2017/3/4 14:26:21 cuda opencl gpu gpgpu

本文介绍了GPU上的分支预测的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个关于GPU中分支预测的问题。据我所知，在GPU中，他们做分支的预测。

I have a question about branch predication in GPUs. As far as I know, in GPUs, they do predication with branches.

例如，我有一个这样的代码：

For example I have a code like this:

if (C)
 A
else
 B

40个周期，B需要50个周期来完成执行，如果假设一个翘曲，A和B都被执行，那么总共需要90个周期来完成这个分支吗？或者它们与A和B重叠，即，当A的一些指令被执行时，然后等待存储器请求，然后执行B的一些指令，然后等待存储器，等等？
感谢

so if A takes 40 cycles and B takes 50 cycles to finish execution, if assuming for one warp, both A and B are executed, so does it take in total 90 cycles to finish this branch? Or do they overlap A and B, i.e., when some instructions of A are executed, then wait for memory request, then some instructions of B are executed, then wait for memory, and so on? Thanks

GPU上的分支预测 [英] Branch predication on GPU

问题描述

推荐答案

相关文章

其它硬件开发最新文章

热门教程

热门工具

登录关闭

GPU上的分支预测 [英] Branch predication on GPU

问题描述

推荐答案

相关文章

其它硬件开发最新文章

热门教程

热门工具

登录 关闭

登录关闭