为什么需要softmax函数?为什么不进行简单的归一化? [英] Why is softmax function necessory? Why not simple normalization?

查看：613 发布时间：2020/5/17 19:30:41 neural-network deep-learning softmax

本文介绍了为什么需要softmax函数?为什么不进行简单的归一化?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我不熟悉深度学习，因此这可能是一个初学者的问题. 据我了解，多层感知器中的softmax函数负责归一化和分配每个类的概率. 如果是这样，为什么我们不使用简单的归一化呢?

I am not familiar with deep learning so this might be a beginner question. In my understanding, softmax function in Multi Layer Perceptrons is in charge of normalization and distributing probability for each class. If so, why don't we use the simple normalization?

比方说，我们得到一个向量x = (10 3 2 1) 应用softmax，输出将为y = (0.9986 0.0009 0.0003 0.0001).

Let's say, we get a vector x = (10 3 2 1) applying softmax, output will be y = (0.9986 0.0009 0.0003 0.0001).

应用简单规范化(将每个元素除以sum(16)) 输出将为y = (0.625 0.1875 0.125 0.166).

Applying simple normalization (dividing each elements by the sum(16)) output will be y = (0.625 0.1875 0.125 0.166).

似乎简单的归一化也可以分布概率. 那么，在输出层上使用softmax函数的好处是什么?

It seems like simple normalization could also distribute the probabilities. So, what is the advantage of using softmax function on the output layer?

为什么需要softmax函数?为什么不进行简单的归一化? [英] Why is softmax function necessory? Why not simple normalization?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

为什么需要softmax函数?为什么不进行简单的归一化? [英] Why is softmax function necessory? Why not simple normalization?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭