一台设备的多个CUDA上下文-有什么意义？ [英] Multiple CUDA contexts for one device - any sense?

查看：328 发布时间：2020/10/13 0:55:26 c++ cuda video-encoding cuda-context

本文介绍了一台设备的多个CUDA上下文-有什么意义？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我以为我掌握了这一点，但显然我没有：)我需要使用NVENC从非编码器接受的任何格式的帧中执行并行H.264流编码。因此，我有以下代码管道：

I thought I had the grasp of this but apparently I do not:) I need to perform parallel H.264 stream encoding with NVENC from frames that are not in any of the formats accepted by the encoder so I have a following code pipeline:

称为新帧到达的回调称为

我复制帧到CUDA内存并执行所需的颜色空间转换（只有第一个 cuMemcpy 是同步的，因此我可以从回调中返回，所有待处理的操作都被推送到专用流中）

我将一个事件推送到流上，并让另一个线程等待它，一旦设置好，我就将CUDA内存指针和帧放在正确的色彩空间中，并将其馈送到解码器

A callback informing that a new frame has arrived is called
I copy the frame to CUDA memory and perform the needed color space conversions (only the first cuMemcpy is synchronous, so I can return from the callback, all pending operations are pushed in a dedicated stream)
I push an event onto the stream and have another thread waiting for it, as soon as it is set I take the CUDA memory pointer with the frame in the correct color space and feed it to the decoder

出于某种原因，我假设如果我在并行线程中执行此管道，则每个线程都需要专用的上下文。代码很慢，经过一番阅读，我了解到上下文切换实际上是很昂贵的，然后我得出的结论是，这是没有意义的，因为在上下文中拥有整个GPU，所以我将其他代码转换器线程的任何并行处理都锁定了

For some reason I had the assumption that I need a dedicated context for each thread if I perform this pipeline in parallel threads. The code was slow and after some reading I understood that the context switching is actually expensive, and then I actually came to the conclusion that it makes no sense since in a context owns the whole GPU so I lock out any parallel processing from other transcoder threads.

问题1：在这种情况下，我可以使用单个上下文以及在此上下文中为每个执行线程执行的显式流

Question 1: In this scenario am I good with using a single context and an explicit stream created on this context for each thread that performs the mentioned pipeline?

问题2：有人能启发我CUDA设备上下文的唯一目的是什么？我认为这在多GPU场景中是有意义的，但是在任何情况下我都想为一个GPU创建多个上下文吗？

Question 2: Can someone enlighten me on what is the sole purpose of the CUDA device context? I assume it makes sense in a multiple GPU scenario, but are there any cases where I would want to create multiple contexts for one GPU?

一台设备的多个CUDA上下文-有什么意义？ [英] Multiple CUDA contexts for one device - any sense?

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录关闭

一台设备的多个CUDA上下文-有什么意义？ [英] Multiple CUDA contexts for one device - any sense?

问题描述

推荐答案

相关文章

C/C++开发最新文章

热门教程

热门工具

登录 关闭

登录关闭