TensorFlow tfrecords:tostring() 改变图像的维度 [英] TensorFlow tfrecords: tostring() changes dimension of image

查看：32 发布时间：2021/11/30 19:43:49 python machine-learning tensorflow conv-neural-network autoencoder

本文介绍了TensorFlow tfrecords:tostring() 改变图像的维度的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我已经建立了一个模型来在 TensorFlow 中训练卷积自编码器.我按照从 TF 文档中读取数据的说明来读取我自己的 233 大小的图像x 233 x 3.这是根据这些指令改编的我的 convert_to() 函数:

I have built a model to train a convolutional autoencoder in TensorFlow. I followed the instructions on Reading Data from the TF documentation to read in my own images of size 233 x 233 x 3. Here is my convert_to() function adapted from those instructions:

def convert_to(images, name):
  """Converts a dataset to tfrecords."""
  num_examples = images.shape[0]
  rows = images.shape[1]
  cols = images.shape[2]
  depth = images.shape[3]

  filename = os.path.join(FLAGS.tmp_dir, name + '.tfrecords')
  print('Writing', filename)
  writer = tf.python_io.TFRecordWriter(filename)
  for index in range(num_examples):
    print(images[index].size)
    image_raw = images[index].tostring()
    print(len(image_raw))
    example = tf.train.Example(features=tf.train.Features(feature={
        'height': _int64_feature(rows),
        'width': _int64_feature(cols),
        'depth': _int64_feature(depth),
        'image_raw': _bytes_feature(image_raw)}))
    writer.write(example.SerializeToString())
  writer.close()

当我在 for 循环开始打印图像的大小时，大小是 162867，但是当我在 .tostring() 行之后打印时，大小是 1302936.这会导致问题，因为模型认为我的输入是应该的 8 倍.将示例中的image_raw"条目更改为 _int64_feature(image_raw) 还是更改将其转换为字符串的方式更好?

When I print the size of the image at the start of the for loop, the size is 162867, but when I print after the .tostring() line, the size is 1302936. This causes problems down the road because the model thinks my input is 8x what it should be. Is it better to change the 'image_raw' entry in the Example to _int64_feature(image_raw) or to change the way I convert it to a string?

或者，问题可能出在我的 read_and_decode() 函数中，例如字符串未正确解码或示例未解析...?

Alternatively, the problem could be in my read_and_decode() function, e.g. the string is not properly being decoded or the example not being parsed...?

def read_and_decode(self, filename_queue):
    reader = tf.TFRecordReader()

    _, serialized_example = reader.read(filename_queue)
    features = tf.parse_single_example(
        serialized_example,
        features={
            'height': tf.FixedLenFeature([], tf.int64),
            'width': tf.FixedLenFeature([], tf.int64),
            'depth': tf.FixedLenFeature([], tf.int64),
            'image_raw': tf.FixedLenFeature([], tf.string)
      })

    # Convert from a scalar string tensor to a uint8 tensor
    image = tf.decode_raw(features['image_raw'], tf.uint8)

    # Reshape into a 233 x 233 x 3 image and apply distortions
    image = tf.reshape(image, (self.input_rows, self.input_cols, self.num_filters))

    image = data_sets.normalize(image)
    image = data_sets.apply_augmentation(image)

    return image

谢谢！

TensorFlow tfrecords:tostring() 改变图像的维度 [英] TensorFlow tfrecords: tostring() changes dimension of image

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

TensorFlow tfrecords:tostring() 改变图像的维度 [英] TensorFlow tfrecords: tostring() changes dimension of image

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭