具有紧密连接的层的辍学 [英] Dropout with densely connected layer

查看：69 发布时间：2020/4/25 10:00:17 tensorflow machine-learning keras deep-learning densenet

本文介绍了具有紧密连接的层的辍学的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我为我的一个项目使用了密集网络模型，并且在使用正则化方面遇到了一些困难.

Iam using a densenet model for one of my projects and have some difficulties using regularization.

没有任何正则化，验证和训练损失(MSE)都会减少.但是，训练损失下降得更快，导致最终模型有些过拟合.

Without any regularization, both validation and training loss (MSE) decrease. The training loss drops faster though, resulting in some overfitting of the final model.

因此，我决定使用辍学以避免过度拟合.使用Dropout时，验证和训练损失在第一个时期都减少到约0.13，并在约10个时期保持不变.

So I decided to use dropout to avoid overfitting. When using Dropout, both validation and training loss decrease to about 0.13 during the first epoch and remain constant for about 10 epochs.

此后，两个损失函数都以没有丢失的相同方式减少，从而导致再次拟合.最终的损耗值与不掉线的范围大致相同.

After that both loss functions decrease in the same way as without dropout, resulting in overfitting again. The final loss value is in about the same range as without dropout.

所以对我来说，辍学似乎并没有真正发挥作用.

So for me it seems like dropout is not really working.

如果我切换到L2正则化，我可以避免过拟合，但我宁愿使用Dropout作为正则化器.

If I switch to L2 regularization though, Iam able to avoid overfitting, but I would rather use Dropout as a regularizer.

现在Iam想知道是否有人经历过这种行为?

Now Iam wondering if anyone has experienced that kind of behaviour?

我在密集块(瓶颈层)和过渡块(辍学率= 0.5)中都使用了辍学:

I use dropout in both the dense block (bottleneck layer) and in the transition block (dropout rate = 0.5):

def bottleneck_layer(self, x, scope):
    with tf.name_scope(scope):
        x = Batch_Normalization(x, training=self.training, scope=scope+'_batch1')
        x = Relu(x)
        x = conv_layer(x, filter=4 * self.filters, kernel=[1,1], layer_name=scope+'_conv1')
        x = Drop_out(x, rate=dropout_rate, training=self.training)

        x = Batch_Normalization(x, training=self.training, scope=scope+'_batch2')
        x = Relu(x)
        x = conv_layer(x, filter=self.filters, kernel=[3,3], layer_name=scope+'_conv2')
        x = Drop_out(x, rate=dropout_rate, training=self.training)

        return x

def transition_layer(self, x, scope):
    with tf.name_scope(scope):
        x = Batch_Normalization(x, training=self.training, scope=scope+'_batch1')
        x = Relu(x)
        x = conv_layer(x, filter=self.filters, kernel=[1,1], layer_name=scope+'_conv1')
        x = Drop_out(x, rate=dropout_rate, training=self.training)
        x = Average_pooling(x, pool_size=[2,2], stride=2)

        return x

具有紧密连接的层的辍学 [英] Dropout with densely connected layer

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

具有紧密连接的层的辍学 [英] Dropout with densely connected layer

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭