如何从CUDA内核函数返回单个变量? [英] How to return a single variable from a CUDA kernel function?

查看:61
本文介绍了如何从CUDA内核函数返回单个变量?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个CUDA搜索功能,可以计算一个单个变量.如何退回.

I have a CUDA search function which calculate one single variable. How can I return it back.

__global__ 
void G_SearchByNameID(node* Node, long nodeCount, long start,char* dest, long answer){
    answer = 2;
}

cudaMemcpy(h_answer, d_answer, sizeof(long), cudaMemcpyDeviceToHost);
cudaFree(d_answer);

对于这两行,我都会收到此错误:错误:"long"类型的参数与"const void *"类型的参数不兼容

for both of these lines I get this error: error: argument of type "long" is incompatible with parameter of type "const void *"

推荐答案

为此,我一直在使用 __ device __ 变量,这样您就不必为 cudaMalloc cudaFree ,您不必将指针作为内核参数传递,这样可以节省内核中的寄存器以进行引导.

I've been using __device__ variables for this purpose, that way you don't have to bother with cudaMalloc and cudaFree and you don't have to pass a pointer as a kernel argument, which saves you a register in your kernel to boot.

__device__ long d_answer;

__global__ void G_SearchByNameID() {
  d_answer = 2;
}

int main() {
  SearchByNameID<<<1,1>>>();
  typeof(d_answer) answer;
  cudaMemcpyFromSymbol(&answer, "d_answer", sizeof(answer), 0, cudaMemcpyDeviceToHost);
  printf("answer: %d\n", answer);
  return 0;
}

这篇关于如何从CUDA内核函数返回单个变量?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆