如何使用视觉识别API在图像上获取自定义对象的位置 [英] How to get a position of custom object on image using vision recognition api

查看:97
本文介绍了如何使用视觉识别API在图像上获取自定义对象的位置的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我知道有很多视觉识别API,例如Clarifai,Watson,Google Cloud Vision,Microsoft Cognitive Services,它们可以识别图像内容。这些服务的响应是包含不同标签的简单json,例如

I know there is a lot of vision recognition APIs such as Clarifai, Watson, Google Cloud Vision, Microsoft Cognitive Services which provide recognition of image content. The response of these services is simple json that contains different tags, for example

{ 
   man: 0.9969295263290405,
   portrait: 0.9949591159820557,
   face: 0.9261120557785034
}

问题是我不仅需要知道图像上的内容,还需要知道该对象的位置。其中一些API具有此类功能,但仅用于面部检测。

The problem is that I need to know not only what is on the image but also the position of that object. Some of those APIs have such feature but only for face detection.

所以有人知道是否有这样的API或我需要在OpenCV上为每个对象训练自己的haar级联。

So does anyone know if there is such API or I need to train own haar cascades on OpenCV for every object.

我将非常感谢您分享一些信息。

I will be very greatful for sharing some info.

推荐答案

您可以看看Wolfram Cloud / Mathematica。

You could take a look at Wolfram Cloud/Mathematica.

它能够检测图片中的对象位置。

It has the ability to detect object locations in a picture.

一些例子。

  • Detecting road signs.
  • Finding Waldo.
  • Object tracking in video.

这篇关于如何使用视觉识别API在图像上获取自定义对象的位置的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆