test_on_batch和train_on_batch的不同损耗值 [英] Different loss values for test_on_batch and train_on_batch

查看：415 发布时间：2020/4/25 10:51:51 python tensorflow machine-learning neural-network keras

本文介绍了test_on_batch和train_on_batch的不同损耗值的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在尝试训练GAN生成图像时，我遇到了一个无法解释的问题.

While trying to train a GAN for image generation I ran into a problem which I cannot explain.

训练发电机时，train_on_batch返回的损耗仅经过2或3次迭代即可直接降至零.经过调查，我意识到train_on_batch方法的一些奇怪行为:

When training the generator, the loss which is returned by train_on_batch after just 2 or 3 iterations directly drops to zero. After investigating I realized some strange behavior of the train_on_batch method:

当我检查以下内容时:

noise = np.random.uniform(-1.0, 1.0, size=[batch_size, gen_noise_length])
predictions = GAN.stackedModel.predict(noise)

这将返回所有接近于零的值，这是我期望的，因为尚未对生成器进行训练.

This returns values all close to zero as I would expect since the generator is not trained yet.

但是:

y = np.ones([batch_size, 1])
noise = np.random.uniform(-1.0, 1.0, size=[batch_size, gen_noise_length])
loss = GAN.stackedModel.train_on_batch(noise, y)

尽管我的预期目标很明显，但损失几乎为零. 当我跑步时:

here the loss is almost zero even though my expected targets are obvious ones. When I run:

y = np.ones([batch_size, 1])
noise = np.random.uniform(-1.0, 1.0, size=[batch_size, gen_noise_length])
loss = GAN.stackedModel.test_on_batch(noise, y)

如我所料，返还的损失很高.

the returned loss is high as I would expect.

train_on_batch方法是怎么回事?我真的很无知...

What is going on with the train_on_batch method? I'm really clueless here...

修改

我的损失是二进制交叉熵，我建立了像这样的模型:

My loss is binary-crossentropy and I build the model like:

def createStackedModel(self):
    # Build stacked GAN model
    gan_in = Input([self.noise_length])
    H = self.genModel(gan_in)
    gan_V = self.disModel(H)
    GAN = Model(gan_in, gan_V)
    opt = RMSprop(lr=0.0001, decay=3e-8)
    GAN.compile(loss='binary_crossentropy', optimizer=opt, metrics=['accuracy'])
    return GAN

修改2

生成器是通过堆叠每个包含BatchNormalization的那些块来构造的:

The generator is constructed by stacking some of those blocks each containing a BatchNormalization:

    self.G.add(UpSampling2D())
    self.G.add(Conv2DTranspose(int(depth/8), 5, padding='same'))
    self.G.add(BatchNormalization(momentum=0.5))
    self.G.add(Activation('relu'))

修改3

我将代码加载到 https://gitlab.com/benjamingraf24/DCGAN/ 显然，问题出在我构建GAN网络的方式上.因此，在GANBuilder.py中一定有问题.但是，我找不到它...

I loaded my code to https://gitlab.com/benjamingraf24/DCGAN/ Apparently the problem results from the way how I build the GAN network. So in GANBuilder.py there must be something wrong. However, I cant find it...

test_on_batch和train_on_batch的不同损耗值 [英] Different loss values for test_on_batch and train_on_batch

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

test_on_batch和train_on_batch的不同损耗值 [英] Different loss values for test_on_batch and train_on_batch

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭