为什么在使用Dropout时要缩放输出? [英] Why do we want to scale outputs when using dropout?

查看：304 发布时间：2021/4/29 20:53:13 machine-learning neural-network deep-learning dropout

本文介绍了为什么在使用Dropout时要缩放输出?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

的想法是在测试时使用单个神经网络而不会出现辍学现象.该网络的权重是受过培训的按比例缩小的版本重量.如果在训练期间以概率p保留一个单元，则在测试时间，该单位的输出权重乘以p为如图2所示.这确保了对于任何隐藏的单元而言，预期的输出(在训练时用于掉落单位的分布下)为与测试时的实际输出相同."

"The idea is to use a single neural net at test time without dropout. The weights of this network are scaled-down versions of the trained weights. If a unit is retained with probability p during training, the outgoing weights of that unit are multiplied by p at test time as shown in Figure 2. This ensures that for any hidden unit the expected output (under the distribution used to drop units at training time) is the same as the actual output at test time."

我们为什么要保留预期的输出?如果我们使用ReLU激活，则权重或激活的线性缩放会导致网络输出的线性缩放，并且不会对分类准确性产生任何影响.

Why do we want to preserve the expected output? If we use ReLU activations, linear scaling of weights or activations results in linear scaling of network outputs and does not have any effect on the classification accuracy.

我想念什么?

为什么在使用Dropout时要缩放输出? [英] Why do we want to scale outputs when using dropout?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

为什么在使用Dropout时要缩放输出? [英] Why do we want to scale outputs when using dropout?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭