对象检测 ARKit 与 CoreML [英] Object detection ARKit vs CoreML

查看:26
本文介绍了对象检测 ARKit 与 CoreML的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在为 iPhone 构建 ARKit 应用程序.我需要检测特定的香水瓶并根据检测到的内容显示内容.我使用 developer.apple.com 上的演示应用程序来扫描现实世界的对象并导出我可以在资产中使用的 .arobject 文件.它工作正常,但由于瓶子来自玻璃检测非常差.它仅在 2-30 秒范围内进行扫描的位置进行检测,或者根本没有检测到.合并扫描并不能改善情况,甚至会使情况变得更糟.合并的结果可能会有奇怪的方向.

I am building ARKit application for iPhone. I need to detect specific perfume bottle and display content depending on what is detected. I used demo app from developer.apple.com to scan real world object and export .arobject file which I can use in assets. It's working fine, although since bottle is from glass detection is very poor. It detects only in location where scan was made in range from 2-30 seconds or doesn't detect at all. Merging of scans doesn't improve situation, something making it even worse. Merged result may have weird orientation.

我该怎么做才能解决这个问题?

What can I do to solve this?

如果没有,CoreML 会帮助我吗?我可以拍很多照片并教模型.如果我要检查每个框架是否与此模型匹配怎么办?这样的做法有机会吗?

If nothing, will CoreML help me? I can make a lot of photos and teach model. What if I'll check each frame for match with this model? Does such approach have any chance?

推荐答案

由于玻璃折射现象和不同的光照条件,香水瓶的物体识别过程(在 ARKit 和 CoreML 中)是最复杂的.

Due to glass refraction phenomenon and different lighting conditions an object recognition process (in ARKit and CoreML) for perfume bottles is the most sophisticated one.

看下图——三个玻璃球在不同的位置:

Look at the following picture – there are three glass balls at different locations:

这些玻璃球具有不同的菲涅耳 IOR(折射率)、环境、相机的视角、大小和光照条件.但它们具有相同的形状、材料和颜色.

These glass balls have different Fresnel's IOR (Index Of Refraction), environment, camera's Point Of View, size and lighting conditions. But they have the same shape, material and colour.

因此,加速识别过程的最佳方法是使用相同的背景/环境(例如单色浅灰色纸 BG)、相同的照明条件(位置、强度、颜色和光的方向),良好的形状可读性(多亏了镜面高光)和相机的相同 POV.

So, the best way to speed up a recognition process is to use identical background/environment (for example monochromatic light-grey paper BG), the same lighting condition (location, intensity, color, and direction of the light), good shape's readability (thanks to specular highlights) and the same POV for your camera.

我知道,有时无法遵循这些提示,但这些提示是有效的.

I know, sometimes it's impossible to follow these tips but these ones are working.

这篇关于对象检测 ARKit 与 CoreML的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆