在Android上从CPU到GPU的最低开销摄像机 [英] Lowest overhead camera to CPU to GPU approach on android

查看：106 发布时间：2020/5/21 0:20:12 android opengl-es android-ndk android-camera

本文介绍了在Android上从CPU到GPU的最低开销摄像机的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的应用程序需要在CPU上的实时摄像机帧上进行一些处理，然后再在GPU上渲染它们. GPU上还渲染了其他一些东西，这取决于CPU处理的结果.因此，使所有内容保持同步非常重要，这样我们才能在GPU对该帧进行处理的结果也可用之前，不在GPU上渲染该帧本身.

My application needs to do some processing on live camera frames on the CPU, before rendering them on the GPU. There's also some other stuff being rendered on the GPU which is dependent on the results of the CPU processing; therefore it's important to keep everything synchronised so we don't render the frame itself on the GPU until the results of the CPU processing for that frame are also available.

问题是，在Android上最低的开销方法是什么?

The question is what's the lowest overhead approach for this on android?

在我的情况下，CPU处理仅需要灰度图像，因此，将Y平面压缩的YUV格式是理想的(并且也很容易与相机设备的本机格式匹配). NV12，NV21或全平面YUV都可以为灰阶提供理想的低开销访问，因此在CPU端将是首选.

The CPU processing in my case just needs a greyscale image, so a YUV format where the Y plane is packed is ideal (and tends to be a good match to the native format of the camera devices too). NV12, NV21 or fully planar YUV would all provide ideal low-overhead access to greyscale, so that would be preferred on the CPU side.

在原始相机API中，setPreviewCallbackWithBuffer()是将数据获取到CPU进行处理的唯一明智的方法. Y平面是分开的，因此非常适合CPU处理.使框架可用于OpenGL以低开销的方式进行渲染是更具挑战性的方面.最后，我编写了一个NEON颜色转换例程以输出RGB565，并仅使用glTexSubImage2d将其在GPU上可用.这首先是在Nexus 1时间范围内实现的，即使是320x240 glTexSubImage2d调用也要花费50ms的CPU时间(我想是可怜的驱动程序试图进行纹理模糊处理-我后来对系统进行了显着改进).

In the original camera API the setPreviewCallbackWithBuffer() was the only sensible way to get data onto the CPU for processing. This had the Y plane separate so was ideal for the CPU processing. Getting this frame available to OpenGL for rendering in a low overhead way was the more challenging aspect. In the end I wrote a NEON color conversion routine to output RGB565 and just use glTexSubImage2d to get this available on the GPU. This was first implemented in the Nexus 1 timeframe, where even a 320x240 glTexSubImage2d call took 50ms of CPU time (poor drivers trying to do texture swizzling I presume - this was significantly improved in a system update later on).

回想过去，我研究了eglImage扩展之类的东西，但是对于用户应用而言，它们似乎不可用或文档不足.我对内部android GraphicsBuffer类进行了一些研究，但理想情况下希望留在受支持的公共API的世界中.

Back in the day I looked into things like eglImage extensions, but they don't seem to be available or well documented enough for user apps. I had a little look into the internal android GraphicsBuffer classes but ideally want to stay in the world of supported public APIs.

android.hardware.camera2 API承诺能够将ImageReader和SurfaceTexture都附加到捕获会话.不幸的是，我在这里看不到任何确保正确的顺序管道的方法-推迟调用updateTexImage()直到CPU处理完毕很容易，但是如果在处理期间到达另一帧，则updateTexImage()将直接跳到最新的框架.看来，在有多个输出的情况下，理想情况下，我希望避免在每个队列中都有独立的帧副本.

The android.hardware.camera2 API had promise with being able to attach both an ImageReader and a SurfaceTexture to a capture session. Unfortunately I can't see any way of ensuring the right sequential pipeline here - holding off calling updateTexImage() until the CPU has processed is easy enough, but if another frame has arrived during that processing then updateTexImage() will skip straight to the latest frame. It also seems with multiple outputs there will be independent copies of the frames in each of the queues that ideally I'd like to avoid.

理想情况下，这就是我想要的:

Ideally this is what I'd like:

相机驱动程序会用最新的帧填充一些内存
CPU获取指向内存中数据的指针，无需复制即可读取Y数据
CPU处理数据并在框架准备就绪时在我的代码中设置一个标志
开始渲染框架时，请检查新框架是否准备就绪
调用一些API来绑定与GL纹理相同的内存
准备好新的帧后，将保存前一帧的缓冲区释放回池中

我看不到在Android上使用公开API完全做到那种零复制样式的方法，但是有什么可能最接近呢?

I can't see a way of doing exactly that zero-copy style with public API on android, but what's the closest that it's possible to get?

我尝试过的一件疯狂的事似乎可行，但是没有记录:ANativeWindow NDK API可以接受数据NV12格式，即使适当的格式常量不是公共头文件中的格式常量之一.这样可以通过memcpy()将NVText数据填充到SurfaceTexture中，从而避免了CPU端颜色转换以及glTexImage2d中驱动程序端发生的任何混乱.那仍然是数据的额外副本，尽管感觉它是不必要的，而且再次声明，因为未记录下来的数据可能无法在所有设备上使用.受支持的连续零拷贝Camera-> ImageReader-> SurfaceTexture或等效的版本将是完美的.

One crazy thing I tried that seems to work, but is not documented: The ANativeWindow NDK API can accept data NV12 format, even though the appropriate format constant is not one of the ones in the public headers. That allows a SurfaceTexture to be filled with NV12 data by memcpy() to avoid CPU-side colour conversion and any swizzling that happens driver side in glTexImage2d. That is still an extra copy of the data though that feels like it should be unnecessary, and again as it's undocumented might not work on all devices. A supported sequential zero-copy Camera -> ImageReader -> SurfaceTexture or equivalent would be perfect.

在Android上从CPU到GPU的最低开销摄像机 [英] Lowest overhead camera to CPU to GPU approach on android

问题描述

推荐答案

相关文章

移动开发最新文章

热门教程

热门工具

登录关闭

在Android上从CPU到GPU的最低开销摄像机 [英] Lowest overhead camera to CPU to GPU approach on android

问题描述

推荐答案

相关文章

移动开发最新文章

热门教程

热门工具

登录 关闭

登录关闭