如何在同一图像中定位多个对象? [英] How to locate multiple objects in the same image?

查看:93
本文介绍了如何在同一图像中定位多个对象?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是TensorFlow的新手.

I am a newbie in TensorFlow.

当前,我正在TensorFlow网站上测试一些分类示例卷积神经网络",它说明了如何将输入图像分类为预定义的类,但是问题是:我不知道如何找到多个同一张图片中的对象.例如,我有一个包含猫和狗的输入图像,并且希望我的图形在输出中显示图像中同时存在猫和狗".

Currently, I am testing some classification's examples "Convolutional Neural Network" in the TensorFlow website, and it explains how to classify input images into pre-defined classes, but the problem is: I can't figure out how to locate multiple objects in the same image. For example, I had an input image with a cat and dog and I want my graph to display in the output that there are both of them "a cat and a dog" in the image.

推荐答案

好问题.检测同一图像中的多个对象沸腾本质上是一个分割问题".两种不错且流行的算法是 YOLO(您只看一次)和SSD(单发多盒检测器).我在底部包括了指向它们的链接.

Great question. Detecting multiple objects in the same image boils is essentially a "segmentation problem". Two nice and popular algorithms are YOLO (You Only Look Once), and SSD(Single Shot Multibox Detector). I included links to them at the bottom.

我会看一些有关YOLO工作原理的视频,看看您是否理解了这个想法.然后阅读有关SSD的论文,看看是否能理解为什么该算法更快,更精确.

I would watch a few videos on how YOLO works, and see if you grasp the idea. Then read the paper on SSD, and see if you get why this algorithm is even faster and more precise.

这两种算法都是单次通过的:它们仅一次"查看图像,并预测它们发现的类别的边界框.有更精确的算法,但算法较慢(他们先选择要查看的许多斑点,然后仅在该斑点上运行分类器.结果是,它们对每个图像多次运行此分类器,这很慢).

Both algorithms are single-pass: they only look at the image "once" and predict bounding boxes for the categories they spot. There are more precise algorithms, but they are slower (they first pick many spots they want to look, and then run a classifier on only that spot. The result is that they run this classifier many times per image, which is slow).

正如您所说的,您是Tensorflow的新手,您可以尝试其他人制作的以下代码: https://github .com/thtrieu/darkflow .非常广泛的自述文件向您展示了如何开始使用自己的数据集.

As you stated you are a newbie to Tensorflow, you can try this code other people made: https://github.com/thtrieu/darkflow . The very extensive readme shows you how to get started on your own dataset.

祝您好运,如果您还有其他问题,或者这些算法不适合您的用例,请告诉我们.

Good luck, and let us know if you have other questions, or if these algorithms do not fit your use-case.

  • YOLO 9000 (https://pjreddie.com/darknet/yolo/)
  • SSD (Single shot multibox detector) (https://arxiv.org/abs/1512.02325)

这篇关于如何在同一图像中定位多个对象?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆