TensorFlow tfrecords:tostring()更改图像的尺寸 [英] TensorFlow tfrecords: tostring() changes dimension of image

查看：108 发布时间：2020/5/4 9:45:14 python machine-learning tensorflow conv-neural-network autoencoder

本文介绍了TensorFlow tfrecords:tostring()更改图像的尺寸的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我建立了一个在TensorFlow中训练卷积自动编码器的模型.我遵循了关于从TF文档中读取数据的说明，以读取自己的大小为233的图像x 233 x3.这是我根据这些指令改编的convert_to()函数:

I have built a model to train a convolutional autoencoder in TensorFlow. I followed the instructions on Reading Data from the TF documentation to read in my own images of size 233 x 233 x 3. Here is my convert_to() function adapted from those instructions:

def convert_to(images, name):
  """Converts a dataset to tfrecords."""
  num_examples = images.shape[0]
  rows = images.shape[1]
  cols = images.shape[2]
  depth = images.shape[3]

  filename = os.path.join(FLAGS.tmp_dir, name + '.tfrecords')
  print('Writing', filename)
  writer = tf.python_io.TFRecordWriter(filename)
  for index in range(num_examples):
    print(images[index].size)
    image_raw = images[index].tostring()
    print(len(image_raw))
    example = tf.train.Example(features=tf.train.Features(feature={
        'height': _int64_feature(rows),
        'width': _int64_feature(cols),
        'depth': _int64_feature(depth),
        'image_raw': _bytes_feature(image_raw)}))
    writer.write(example.SerializeToString())
  writer.close()

当我在for循环的开始处打印图像的大小时，大小为162867，但是当我在.tostring()行之后打印时，大小为1302936.这会在以后产生问题，因为该模型认为我的输入是应该输入的8倍.将示例中的"image_raw"条目更改为_int64_feature(image_raw)还是更改将其转换为字符串的方式更好?

When I print the size of the image at the start of the for loop, the size is 162867, but when I print after the .tostring() line, the size is 1302936. This causes problems down the road because the model thinks my input is 8x what it should be. Is it better to change the 'image_raw' entry in the Example to _int64_feature(image_raw) or to change the way I convert it to a string?

或者，问题可能出在我的read_and_decode()函数中，例如字符串未正确解码或示例未解析...?

Alternatively, the problem could be in my read_and_decode() function, e.g. the string is not properly being decoded or the example not being parsed...?

def read_and_decode(self, filename_queue):
    reader = tf.TFRecordReader()

    _, serialized_example = reader.read(filename_queue)
    features = tf.parse_single_example(
        serialized_example,
        features={
            'height': tf.FixedLenFeature([], tf.int64),
            'width': tf.FixedLenFeature([], tf.int64),
            'depth': tf.FixedLenFeature([], tf.int64),
            'image_raw': tf.FixedLenFeature([], tf.string)
      })

    # Convert from a scalar string tensor to a uint8 tensor
    image = tf.decode_raw(features['image_raw'], tf.uint8)

    # Reshape into a 233 x 233 x 3 image and apply distortions
    image = tf.reshape(image, (self.input_rows, self.input_cols, self.num_filters))

    image = data_sets.normalize(image)
    image = data_sets.apply_augmentation(image)

    return image

谢谢！

TensorFlow tfrecords:tostring()更改图像的尺寸 [英] TensorFlow tfrecords: tostring() changes dimension of image

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

TensorFlow tfrecords:tostring()更改图像的尺寸 [英] TensorFlow tfrecords: tostring() changes dimension of image

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭