Keras VGG16 preprocess_input 模式 [英] Keras VGG16 preprocess_input modes

查看：46 发布时间：2021/12/27 17:25:24 tensorflow keras deep-learning vgg-net image-preprocessing

本文介绍了Keras VGG16 preprocess_input 模式的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我看到有一个 preprocess_input 方法与 VGG16 模型结合使用.此方法似乎调用了 imagenet_utils.py 中的 preprocess_input 方法其中(视情况而定)调用 _preprocess_numpy_input 方法在 imagenet_utils.py 中.

I've seen it there is a preprocess_input method to use in conjunction with the VGG16 model. This method appears to call the preprocess_input method in imagenet_utils.py which (depending on the case) calls _preprocess_numpy_input method in imagenet_utils.py.

preprocess_input 有一个 mode 参数，它需要caffe"、tf"或torch".如果我在 Keras 中使用带有 TensorFlow 后端的模型，我绝对应该使用 mode="tf" 吗?

The preprocess_input has a mode argument which expects "caffe", "tf", or "torch". If I'm using the model in Keras with TensorFlow backend, should I absolutely use mode="tf"?

如果是，这是因为 Keras 加载的 VGG16 模型是用经过相同预处理的图像训练的(即将输入图像的范围从 [0,255] 更改为输入范围 [-1,1])?

If yes, is this because the VGG16 model loaded by Keras was trained with images which underwent the same preprocessing (i.e. changed input image's range from [0,255] to input range [-1,1])?

此外，测试模式的输入图像是否也应进行此预处理?我相信最后一个问题的答案是肯定的，但我想得到一些保证.

Also, should the input images for testing mode also undergo this preprocessing? I'm confident the answer to the last question is yes, but I would like some reassurance.

我希望 Francois Chollet 能够正确地完成它，但查看 https://github.com/fchollet/deep-learning-models/blob/master/vgg16.py 使用 mode="tf".

I would expect Francois Chollet to have done it correctly, but looking at https://github.com/fchollet/deep-learning-models/blob/master/vgg16.py either he is or I am wrong about using mode="tf".

更新信息

@FalconUA 将我引导至牛津的 VGG有一个模型部分，其中包含 16 层模型的链接.找到关于 preprocessing_input mode 参数 tf 缩放到 -1 到 1 和 caffe 减去一些平均值的信息按照模型 16 层模型中的链接:信息页.在描述部分，它说:

@FalconUA directed me to the VGG at Oxford which has a Models section with links for the 16-layer model. The information about the preprocessing_input mode argument tf scaling to -1 to 1 and caffe subtracting some mean values is found by following the link in the Models 16-layer model: information page. In the Description section it says:

在论文中，模型被表示为经过尺度抖动训练的配置D.输入图像应该通过平均像素(而不是平均图像)减法以零为中心.即，应该减去以下BGR值:[103.939、116.779、123.68]."

"In the paper, the model is denoted as the configuration D trained with scale jittering. The input images should be zero-centered by mean pixel (rather than mean image) subtraction. Namely, the following BGR values should be subtracted: [103.939, 116.779, 123.68]."

Keras VGG16 preprocess_input 模式 [英] Keras VGG16 preprocess_input modes

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Keras VGG16 preprocess_input 模式 [英] Keras VGG16 preprocess_input modes

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭