不同的学习率会影响Batchnorm设置.为什么? [英] Different learning rate affect to batchnorm setting. Why?

查看：391 发布时间：2020/5/4 10:01:59 machine-learning neural-network deep-learning caffe

本文介绍了不同的学习率会影响Batchnorm设置.为什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用BatchNorm层.我知道设置use_global_stats的含义，通常将false设置为训练，将true设置为测试/部署.这是我在测试阶段的设置.

I am using BatchNorm layer. I know the meaning of setting use_global_stats that often set false for training and true for testing/deploy. This is my setting in the testing phase.

layer {
  name: "bnorm1"
  type: "BatchNorm"
  bottom: "conv1"
  top: "bnorm1"
  batch_norm_param {
    use_global_stats: true
  }
}
layer {
  name: "scale1"
  type: "Scale"
  bottom: "bnorm1"
  top: "bnorm1"
  bias_term: true
  scale_param {
    filler {
      value: 1
    }    
    bias_filler {
      value: 0.0
    }
  }
}

在Solver.prototxt中，我使用了Adam方法.我发现一个有趣的问题发生在我的情况下.如果选择base_lr: 1e-3，则在测试阶段设置use_global_stats: false时会获得良好的性能.但是，如果选择base_lr: 1e-4，则在测试阶段设置use_global_stats: true时会获得良好的性能.它表明base_lr对batchnorm设置有影响(即使我使用了Adam方法)?您能提出任何理由吗?谢谢大家

In solver.prototxt, I used the Adam method. I found an interesting problem that happens in my case. If I choose base_lr: 1e-3, then I got a good performance when I set use_global_stats: false in the testing phase. However, if I chose base_lr: 1e-4, then I got a good performance when I set use_global_stats: true in the testing phase. It demonstrates that base_lr effects to the batchnorm setting (even I used Adam method)? Could you suggest any reason for that? Thanks all

不同的学习率会影响Batchnorm设置.为什么? [英] Different learning rate affect to batchnorm setting. Why?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

不同的学习率会影响Batchnorm设置.为什么? [英] Different learning rate affect to batchnorm setting. Why?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭