蛮力“散景"卷积和避免TDR. [英] Brute force "bokeh" convolution and TDR avoidance.

查看：74 发布时间：2019/6/18 19:54:47 parallelcppnative

本文介绍了蛮力“散景"卷积和避免TDR.的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在执行的卷积可能需要一秒到16到20秒的时间，具体取决于源图像的大小.在典型的屏幕分辨率"屏幕上，平均时间约为2-4秒.图片.

I am performing a convolution that can take anywhere from a second to 16 to 20 seconds, depending on the size of the source image. The average time is about 2-4 seconds on a typical "screen resolution" image.

我正在经历TDR，也就是显示设备已停止响应并已重置".并且我正在探索如何避免这些情况.简单地恢复不是一种选择，因为我不拥有"自己的财产.恢复过程似乎存在的内存取消分配.

I am experiencing TDRs, aka, "the display device has stopped responding and has been reset" and I am exploring how to avoid these. Simply recovering is not an option because I do not "own" the memory that the recovery process seems to de-allocate.

我的算法将附近像素乘以定义为自定义形状(例如六边形或八边形)的内核中的值(例如摄像机镜头系统中的虹膜形状)，因此该算法不可分离(aka，x和y)分别需要计算每个像素都用蛮力.

My algorithm multiplies nearby pixels by values in a kernel that is defined as a custom shape, such as a hexagon or octagon (like the shape of the iris in a camera lens system) so the algorithm is not separable (aka, x and y separately) it needs to compute each pixel by brute force.

我的第一个想法是将算法一次分成90条扫描线等条带.在这方面，我遇到了几个问题，而且我从未确认仅对parallel_for_each进行多次调用就足以解决问题.一世需要帮助来理解问题，并且还需要解决与条带"相关的各种问题.我尝试过的方法.

My first thought was to separate the algorithm into strips such as 90 scan lines at a time. I had several problems in this regard, and I never confirmed that just making multiple calls to parallel_for_each was enough to solve the problems. I need help in understanding the problem, and also solving various problems related to the "strip" method I tried.

我遇到的第一个问题是这个.调用parallel_for_each需要一个范围.我不知道如何定义作为数组一部分的范围. (例如，扫描线90-180为0-1000)意思是，我想通过全部数组视图，但仅为其一部分创建线程.

The first problem I had was this. A call to parallel_for_each requires an extent. I do not know how to define an extent that is a portion of an array. (scan-line 90-180 of 0-1000, for example) Meaning, I would want to pass the full array view, but only create threads for a portion of it.

我尝试制作多个数组视图，但是由于我正在进行卷积运算，因此可以访问相邻像素，这不起作用，因为GPU内存中不存在阵列视图之外的像素.

I tried to make multiple array views, but since I'm doing a convolution that can access neighboring pixels, this did not work, because pixels outside of the array view did not exist in GPU memory.

我可以将其他参数传递给parallel_for_each，让我可以控制要处理的带区的一部分，但这意味着我必须多次往返于GPU多次复制全部内存.时代，我仍然不知道如何只为数组视图的一部分创建线程.

I could pass additional parameters to parallel_for_each that lets me control the part of the strip I want to work on, but that would mean I would have to copy the full amount of memory many times back and forth for and from the GPU many times, and I still don't know how to create threads for only part of the array view.

所以我希望我已经正确，清楚地解释了这一切.如果我的方向不对，请有人打我，也不要只是说我应该用谷歌搜索一下，因为这是一个复杂的问题，没有明显的解决方案.感谢预先的帮助.

So I hope I have explained this all correctly and clearly. Somebody please slap me if I'm heading in the wrong direction here, and please don't just say I should google it because this is a complex problem with no obvious solution. Thanks for the help in advance.

蛮力“散景"卷积和避免TDR. [英] Brute force "bokeh" convolution and TDR avoidance.

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录关闭

蛮力“散景"卷积和避免TDR. [英] Brute force &quot;bokeh&quot; convolution and TDR avoidance.

问题描述

推荐答案

相关文章

其他开发语言最新文章

热门教程

热门工具

登录 关闭

蛮力“散景"卷积和避免TDR. [英] Brute force "bokeh" convolution and TDR avoidance.

登录关闭