如何使压差取反以补偿压差的影响并保持期望值不变? [英] How inverting the dropout compensates the effect of dropout and keeps expected values unchanged?

查看：129 发布时间：2020/5/4 10:12:42 machine-learning neural-network deep-learning regularized dropout

本文介绍了如何使压差取反以补偿压差的影响并保持期望值不变?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在从deeplearning.ai课程中学习神经网络中的正则化.教授说，在辍学正则化中，如果应用了辍学，则计算出的激活值将小于未应用辍学(测试时)的激活值.因此，我们需要调整激活次数，以使测试阶段更简单.

I'm learning regularization in Neural networks from deeplearning.ai course. Here in dropout regularization, the professor says that if dropout is applied, the calculated activation values will be smaller then when the dropout is not applied (while testing). So we need to scale the activations in order to keep the testing phase simpler.

我了解这个事实，但是我不知道如何进行缩放.这是一个用于实现反向辍学的代码示例.

I understood this fact, but I don't understand how scaling is done. Here is a code sample which is used to implement inverted dropout.

keep_prob = 0.8   # 0 <= keep_prob <= 1
l = 3  # this code is only for layer 3
# the generated number that are less than 0.8 will be dropped. 80% stay, 20% dropped
d3 = np.random.rand(a[l].shape[0], a[l].shape[1]) < keep_prob

a3 = np.multiply(a3,d3)   # keep only the values in d3

# increase a3 to not reduce the expected value of output
# (ensures that the expected value of a3 remains the same) - to solve the scaling problem
a3 = a3 / keep_prob

在上面的代码中，为什么将激活除以0.8或将节点保留在层中的概率(keep_prob)?任何数值示例都会有所帮助.

In the above code, why the activations are divided by 0.8 or the probability of keeping a node in a layer (keep_prob)? Any numerical example will help.

如何使压差取反以补偿压差的影响并保持期望值不变? [英] How inverting the dropout compensates the effect of dropout and keeps expected values unchanged?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

如何使压差取反以补偿压差的影响并保持期望值不变? [英] How inverting the dropout compensates the effect of dropout and keeps expected values unchanged?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭