Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置? [英] Is it possible for either Microsoft Computer Vision API or Google's Cloud Vision API to get a location for objects?

查看:24
本文介绍了Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在尝试开发一个需要知道图像中标记对象位置的应用程序.知道图像中有钢琴"是不够的,我需要知道钢琴在图像中的位置.

I am trying to develop an application that needs to know the location of tagged objects in an image. Knowing that there is a "piano" in an image is not enough, I need to know where that piano is in the image.

Microsoft 的计算机视觉 API 和 Google 的 Cloud Vision API 都提供了某种形式的裁剪建议/智能缩略图生成服务,这让我认为正在检测某些对象的位置 - 但是有没有办法获取该信息(例如每个检测到的对象周围的边界框)来自 Microsoft 的计算机视觉 API 还是 Google 的 Cloud Vision API?

Both Microsoft's Computer Vision API and Google's Cloud Vision API provide some form of cropping suggestion/smart thumbnail generation service which leads me to think that the location of certain objects is being detected - however is there a way to get that information (like a bounding box around each detected object) from either Microsoft's Computer Vision API or Google's Cloud Vision API?

我知道这两个 API 都可以返回图像中检测到的人脸的位置,但是我正在寻找图像中每个对象的位置和大小:汽车、钢琴、树木、人...任何东西.

I understand that both APIs can return the location of faces detected in an image, however I am looking for locations and sizes of every object in an image: cars, pianos, trees, people...anything.

推荐答案

2020 更新:

这个问题已经有几年了,但是 Microsoft Azure 计算机视觉API 现在能够在图像中检测到的对象周围绘制边界框.这是一个 Python 示例.其他语言也可用.

This question is a few years old, but the Microsoft Azure Computer Vision API is now able to draw bounding boxes around objects that are detected in an image. Here is a sample in Python. Other languages are available as well.

计算机视觉文档:https://docs.microsoft.com/en-us/azure/cognitive-services/computer-vision/

计算机视觉 SDK:https://docs.microsoft.com/en-us/python/api/azure-cognitiveservices-vision-computervision/?view=azure-python

计算机视觉 API:https://https://westus.dev.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/56f91f2e778daf14a499f21b.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/56f91f2e778daf14a499f21b

这篇关于Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆