Tensorflow:如何创建Pascal VOC样式图像 [英] Tensorflow: How to create a Pascal VOC style image

查看：167 发布时间：2020/11/27 3:14:53 python tensorflow image-segmentation

本文介绍了Tensorflow:如何创建Pascal VOC样式图像的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在努力在Tensorflow中实现语义分割网络，并且试图弄清楚如何在训练期间写出标签的摘要图像.我想以与 Pascal VOC数据集中使用的类别细分注释.

I'm working on implementing a semantic segmentation network in Tensorflow, and I'm trying to figure out how to write out summary images of the labels during training. I want to encode the images in a similar style to the class segmentation annotations used in the Pascal VOC dataset.

例如，假设我有一个网络，该网络以4个类的批次大小进行训练.网络最终预测的形状为[1, 3, 3, 4]

For example, let's assume I have a network that trains on a batch size of 1 with 4 classes. The networks final predictions have shape [1, 3, 3, 4]

基本上，我想获取输出预测并通过argmin运行以获得包含在输出中每个点上最有可能的类的张量:

Essentially I want to take the output predictions and run it through argmin to get a tensor containing the most likely class at each point in the output:

[[[0, 1, 3], 
  [2, 0, 1],
  [3, 1, 2]]]

带注释的图像使用255种颜色的调色板来编码标签.我有一个包含所有颜色三元组的张量:

The annotated images use a color palette of 255 colors to encode labels. I have a tensor containing all the color triples:

  [[  0,   0,   0],
   [128,   0,   0],
   [  0, 128,   0],
   [128, 128,   0],
   [  0,   0, 128],
   ...
   [224, 224, 192]]

我如何获得形状为[1, 3, 3, 3]的张量(单个3x3彩色图像)，该张量使用从argmin获得的值索引到调色板中?

How could I obtain a tensor of shape [1, 3, 3, 3] (a single 3x3 color image) that indexes into the color palette using the values obtained from argmin?

[[palette[0], palette[1], palette[3]],
 [palette[2], palette[0], palette[1]],
 [palette[3], palette[1], palette[2]]]

我可以轻松地在tf.py_func中包装一些numpy和PIL代码，但是我想知道是否有一种纯粹的Tensorflow方式来获得此结果.

I could easily wrap some numpy and PIL code in tf.py_func but I'm wondering if there is a pure Tensorflow way of obtaining this result.

对于那些好奇的人，这是我仅使用numpy的解决方案.它工作得很好，但是我仍然不喜欢tf.py_func的使用:

For those curious, this is the solution I got using just numpy. It works quite well, but I still dislike the use of tf.py_func:

import numpy as np
import tensorflow as tf


def voc_colormap(N=256):
    bitget = lambda val, idx: ((val & (1 << idx)) != 0)

    cmap = np.zeros((N, 3), dtype=np.uint8)
    for i in range(N):
        r = g = b = 0
        c = i
        for j in range(8):
            r |= (bitget(c, 0) << 7 - j)
            g |= (bitget(c, 1) << 7 - j)
            b |= (bitget(c, 2) << 7 - j)
            c >>= 3

        cmap[i, :] = [r, g, b]
    return cmap


VOC_COLORMAP = voc_colormap()


def grayscale_to_voc(input, name="grayscale_to_voc"):
    return tf.py_func(grayscale_to_voc_impl, [input], tf.uint8, stateful=False, name=name)


def grayscale_to_voc_impl(input):
    return np.squeeze(VOC_COLORMAP[input])

推荐答案

您可以使用 tf.gather_nd()，但是您将需要修改调色板的形状并登录以获取所需的图像，例如:

You can use tf.gather_nd(), but you will need to modify the shapes of the palette and logits to obtain the desired image, for example:

import tensorflow as tf
import numpy as np
import PIL.Image as Image

# We can load the palette from some random image in the PASCAL VOC dataset
palette = Image.open('.../VOC2012/SegmentationClass/2007_000032.png').getpalette()

# We build a random logits tensor of the requested size
batch_size = 1
height = width = 3
num_classes = 4
np.random.seed(1234)
logits = np.random.random_sample((batch_size, height, width, num_classes))
logits_argmax = np.argmax(logits, axis=3)  # shape = (1, 3, 3)
# array([[[3, 3, 0],
#         [1, 3, 1],
#         [0, 2, 0]]])

sess = tf.InteractiveSession()
image = tf.gather_nd(
    params=tf.reshape(palette, [-1, 3]),  # reshaped from list to RGB
    indices=tf.reshape(logits_argmax, [batch_size, -1, 1]))
image = tf.cast(tf.reshape(image, [batch_size, height, width, 3]), tf.uint8)
sess.run(image)
# array([[[[128, 128,   0],
#          [128, 128,   0],
#          [  0,   0,   0]],
#         [[128,   0,   0],
#          [128, 128,   0],
#          [128,   0,   0]],
#         [[  0,   0,   0],
#           [  0, 128,   0],
#           [  0,   0,   0]]]], dtype=uint8)

生成的张量可以直接馈送到 tf.summary.image( )，但是根据您的实现，您应该在摘要之前对其进行升采样.

The resulting tensor can be directly fed to a tf.summary.image(), but depending on your implementation you should upsample it before the summary.

这篇关于Tensorflow:如何创建Pascal VOC样式图像的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

Tensorflow:如何创建Pascal VOC样式图像 [英] Tensorflow: How to create a Pascal VOC style image

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

Tensorflow:如何创建Pascal VOC样式图像 [英] Tensorflow: How to create a Pascal VOC style image

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭