手动计算交叉熵与在Tensorflow中使用softmax_cross_entropy_with_logits [英] calculating cross entropy manually vs using softmax_cross_entropy_with_logits in tensorflow

查看：77 发布时间：2020/10/19 22:29:06 python tensorflow deep-learning

本文介绍了手动计算交叉熵与在Tensorflow中使用softmax_cross_entropy_with_logits的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我遇到了一个问题，我试图使用tensorflow为MNIST数据集创建一个深层的ReLU网络。当我将损失用作内置的 tf.nn.softmax_cross_entropy_with_logits（） 时，效果很好，但是手动计算熵项似乎无效。

I'm running into an issue where I'm trying to create a deep ReLU network using tensorflow for the MNIST dataset. It's working fine when I use my loss as the built in tf.nn.softmax_cross_entropy_with_logits(), but calculating the entropy term manually doesn't seem to work.

这是网络的样子：

train_subset = 200
num_features = 784
num_labels = 10
num_units = 200

bias1 = tf.Variable(tf.constant(0.1, shape=[num_units]), name="bias1")
bias2= tf.Variable(tf.constant(0.1, shape=[num_units]), name="bias2")
bias3= tf.Variable(tf.constant(0.1, shape=[num_units]), name="bias3")
bias_out = tf.Variable(tf.constant(0.1, shape=[num_labels]), name="bias_out")

weights1 = tf.Variable(tf.random_normal([num_features, num_units]), name="weights_layer1")
weights2 = tf.Variable(tf.random_normal([num_units, num_units]), name="weights_layer2")
weights3 = tf.Variable(tf.random_normal([num_units, num_units]), name="weights_layer3")
weights_out = tf.Variable(tf.random_normal([num_units, num_labels]), name="weights_out")

# The deep ReLU network
h_relu1 = tf.nn.relu(tf.add(tf.matmul(x, weights1), bias1))
h_relu2 = tf.nn.relu(tf.add(tf.matmul(h_relu1, weights2), bias2))
h_relu3 = tf.nn.relu(tf.add(tf.matmul(h_relu2, weights3), bias3))
logits = tf.matmul(h_relu3, weights_out) + bias_out

换句话说，这很好：

# Assume that y_ is fed a batch of output labels for MNIST
y_ = tf.placeholder(tf.float32, [None, num_labels], name='y-input')
cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits, y_))
optimizer = tf.train.AdamOptimizer(1e-3).minimize(cost)

但不是这样：

y = tf.nn.softmax(logits)
cost = -tf.reduce_sum(y_ * tf.log(y))
optimizer = tf.train.AdamOptimizer(1e-3).minimize(cost)

后者运行良好，但在第一步之后精度仍然受到限制。前者使用softmax_cross_entropy_with_logits函数实际上确实学到了一些东西。我已经看到了后者的设置用于MNIST的深入示例，这就是为什么我想知道我的设置导致优化过程停滞的原因。

The latter runs fine, but the accuracy gets stuck after an initial step. The former using the softmax_cross_entropy_with_logits function actually does learn something. I've seen the latter's setup being used for the deep MNIST example, which is why I'm wondering what it is about my setup here that is causing the optimization procedure to stall.

手动计算交叉熵与在Tensorflow中使用softmax_cross_entropy_with_logits [英] calculating cross entropy manually vs using softmax_cross_entropy_with_logits in tensorflow

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

手动计算交叉熵与在Tensorflow中使用softmax_cross_entropy_with_logits [英] calculating cross entropy manually vs using softmax_cross_entropy_with_logits in tensorflow

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭