Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置? [英] Is it possible for either Microsoft Computer Vision API or Google's Cloud Vision API to get a location for objects?

查看：24 发布时间：2021/11/15 3:34:09 api computer-vision microsoft-cognitive google-cloud-vision

本文介绍了Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试开发一个需要知道图像中标记对象位置的应用程序.知道图像中有钢琴"是不够的，我需要知道钢琴在图像中的位置.

I am trying to develop an application that needs to know the location of tagged objects in an image. Knowing that there is a "piano" in an image is not enough, I need to know where that piano is in the image.

Microsoft 的计算机视觉 API 和 Google 的 Cloud Vision API 都提供了某种形式的裁剪建议/智能缩略图生成服务，这让我认为正在检测某些对象的位置 - 但是有没有办法获取该信息(例如每个检测到的对象周围的边界框)来自 Microsoft 的计算机视觉 API 还是 Google 的 Cloud Vision API?

Both Microsoft's Computer Vision API and Google's Cloud Vision API provide some form of cropping suggestion/smart thumbnail generation service which leads me to think that the location of certain objects is being detected - however is there a way to get that information (like a bounding box around each detected object) from either Microsoft's Computer Vision API or Google's Cloud Vision API?

我知道这两个 API 都可以返回图像中检测到的人脸的位置，但是我正在寻找图像中每个对象的位置和大小:汽车、钢琴、树木、人...任何东西.

I understand that both APIs can return the location of faces detected in an image, however I am looking for locations and sizes of every object in an image: cars, pianos, trees, people...anything.

Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置? [英] Is it possible for either Microsoft Computer Vision API or Google's Cloud Vision API to get a location for objects?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置? [英] Is it possible for either Microsoft Computer Vision API or Google&#39;s Cloud Vision API to get a location for objects?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

Microsoft 计算机视觉 API 或 Google 的 Cloud Vision API 是否可以获取对象的位置? [英] Is it possible for either Microsoft Computer Vision API or Google's Cloud Vision API to get a location for objects?

登录关闭