用OpenCL记忆()缓存GPU缓冲区的最快方法是什么? [英] What is the fastest way to memset() a GPU buffer with OpenCL?

查看：248 发布时间：2020/5/20 18:52:37 performance opencl memset

本文介绍了用OpenCL记忆()缓存GPU缓冲区的最快方法是什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用OpenCL，我需要memset()在全局设备内存中放置一些数组. CUDA具有类似memset()的API函数，但OpenCL没有.我阅读了此，在其中我找到了两种可能的选择:

I'm using OpenCL, and I need to memset() some array in global device memory. CUDA has a memset()-like API function, but OpenCL does not. I read this, where I found two possible alternatives:

在主机上使用memset()并使用一些暂存缓冲区，然后clEnqueueWriteBuffer()将其复制到设备上的缓冲区.
排队以下内核:

using memset() on the host with some scratch buffer, then clEnqueueWriteBuffer() to copy that to the buffer on the device.
Enqueueing the following kernel:

__kernel void memset_uint4(
    __global  uint4* mem,
    __private uint4  val) 
{
    mem[get_global_id(0)] = val; 
}

哪个更好?或者更确切地说，在哪种情况下/哪种平台比另一种更好?

Which is better? Or rather, under which circumstances/for which platforms is one better than the other?

注意:如果零记忆的特殊情况值得特殊对待，那也很好.

Note: If the special case of zero'ing memory merits special treatment, that would be nice to know too.

用OpenCL记忆()缓存GPU缓冲区的最快方法是什么? [英] What is the fastest way to memset() a GPU buffer with OpenCL?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

用OpenCL记忆()缓存GPU缓冲区的最快方法是什么? [英] What is the fastest way to memset() a GPU buffer with OpenCL?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭