将设备指针传递到cublasSnrm2时发生分段故障 [英] Segmentation fault when passing device pointer to cublasSnrm2

查看:430
本文介绍了将设备指针传递到cublasSnrm2时发生分段故障的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

下面的cublas代码给我们的错误:在cublasSnrm2(handle,row,dy,incy,de)的核心转储,你能给一些建议吗?

The code of cublas below give us the errors:core dumped while being at "cublasSnrm2(handle,row,dy,incy,de)",could you give some advice?

main.cu

#include <iostream>
#include "cublas.h"
#include "cublas_v2.h"
#include "helper_cuda.h"

using namespace std;

int main(int argc,char *args[])
{
    float y[10] = {1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0,1.0}; 

    int dev=0;
    checkCudaErrors(cudaSetDevice(dev));

    //cublas init
    cublasStatus stat;
    cublasInit();

    cublasHandle_t  handle;
    stat = cublasCreate(&handle);
    if (stat !=CUBLAS_STATUS_SUCCESS)
    {
       printf("cublas handle create failed!\n");
       cublasShutdown();
    }   
   float * dy,*de,*e;
   int incy = 1,ONE = 1,row = 10; 
   e = (float *)malloc(sizeof(float)*ONE);
   e[0]=0.0f;

   checkCudaErrors(cudaMalloc(&dy,sizeof(float)*row));
   checkCudaErrors(cudaMalloc(&de,sizeof(float)*ONE));
  checkCudaErrors(cudaMemcpy(dy,y,row*sizeof(float),cudaMemcpyHostToDevice));
  checkCudaErrors(cudaMemcpy(de,e,ONE*sizeof(float),cudaMemcpyHostToDevice));
   stat = cublasSnrm2(handle,row,dy,incy,de);
   if (stat !=CUBLAS_STATUS_SUCCESS)
   {
       printf("norm2 compute failed!\n");
       cublasShutdown();
   }
   checkCudaErrors(cudaMemcpy(e,de,ONE*sizeof(float),cudaMemcpyDeviceToHost));
   std::cout<<e[0]<<endl;
    return 0;
}

makefile如下:

makefile is below:

NVIDIA = $(HOME)/NVIDIA_CUDA-5.0_Samples
CUDA = /usr/local/cuda-5.0
NVIDINCADD = -I$(NVIDIA)/common/inc
CUDAINCADD = -I$(CUDA)/include 
CC = -L/usr/lib64/ -lstdc++

GCCOPT = -O2 -fno-rtti -fno-exceptions 
INTELOPT = -O3 -fno-rtti -xW -restrict -fno-alias
DEB = -g
NVCC = -G
ARCH = -arch=sm_35

bcg:main.cu
    nvcc $(DEB) $(NVCC)  $(ARCH) $(CC) -lm  $(NVIDINCADD) $(CUDAINCADD) -lcublas -I./ -o $(@) $(<) 
clean:
    rm -f bcg
    rm -f hyb

我的操作系统是linux redhat 6.2,CUDA版本是5.0,GPU是K20M。

My OS is linux redhat 6.2,CUDA's version is 5.0, GPU is K20M.

推荐答案

问题是这里:

cublasSnrm2(handle,row,dy,incy,de);

默认情况下,最后一个参数是主机指针。因此,或者将 e 传递给snrm2调用,而不是 de 或者这样做:

By default, the last parameter is a host pointer. So either pass e to the snrm2 call rather than de or do this:

cublasSetPointerMode(handle,CUBLAS_POINTER_MODE_DEVICE); 
stat = cublasSnrm2(handle,row,dy,incy,de);

如果要传递设备指针以存储结果,则需要将指针模式设置为device 。

The pointer mode needs to be set to device if you want to pass a device pointer to store the result.

这篇关于将设备指针传递到cublasSnrm2时发生分段故障的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆