我在哪里可以在Keras中调用BatchNormalization函数? [英] Where do I call the BatchNormalization function in Keras?
问题描述
如果我想在Keras中使用BatchNormalization函数,那么我是否只需要在开始时调用一次?
If I want to use the BatchNormalization function in Keras, then do I need to call it once only at the beginning?
我已阅读此文档: http://keras.io/layers/normalization/
我不知道该怎么称呼它.下面是我尝试使用它的代码:
I don't see where I'm supposed to call it. Below is my code attempting to use it:
model = Sequential()
keras.layers.normalization.BatchNormalization(epsilon=1e-06, mode=0, momentum=0.9, weights=None)
model.add(Dense(64, input_dim=14, init='uniform'))
model.add(Activation('tanh'))
model.add(Dropout(0.5))
model.add(Dense(64, init='uniform'))
model.add(Activation('tanh'))
model.add(Dropout(0.5))
model.add(Dense(2, init='uniform'))
model.add(Activation('softmax'))
sgd = SGD(lr=0.1, decay=1e-6, momentum=0.9, nesterov=True)
model.compile(loss='binary_crossentropy', optimizer=sgd)
model.fit(X_train, y_train, nb_epoch=20, batch_size=16, show_accuracy=True, validation_split=0.2, verbose = 2)
我问,因为如果我用第二行(包括批处理规范化)运行代码,并且如果我不使用第二行运行代码,则会得到类似的输出.因此,要么我没有在正确的位置调用该函数,要么我猜它并没有太大的区别.
I ask because if I run the code with the second line including the batch normalization and if I run the code without the second line I get similar outputs. So either I'm not calling the function in the right place, or I guess it doesn't make that much of a difference.
推荐答案
只是稍微详细地回答了这个问题,正如Pavel所说的,批处理规范化只是另一层,因此您可以使用它来创建您的所需的网络架构.
Just to answer this question in a little more detail, and as Pavel said, Batch Normalization is just another layer, so you can use it as such to create your desired network architecture.
一般用例是在网络的线性层和非线性层之间使用BN,因为它可以将激活函数的输入归一化,从而使您位于激活函数的线性部分的中心(例如乙状结肠). 此处
The general use case is to use BN between the linear and non-linear layers in your network, because it normalizes the input to your activation function, so that you're centered in the linear section of the activation function (such as Sigmoid). There's a small discussion of it here
在上述情况下,它可能类似于:
In your case above, this might look like:
# import BatchNormalization
from keras.layers.normalization import BatchNormalization
# instantiate model
model = Sequential()
# we can think of this chunk as the input layer
model.add(Dense(64, input_dim=14, init='uniform'))
model.add(BatchNormalization())
model.add(Activation('tanh'))
model.add(Dropout(0.5))
# we can think of this chunk as the hidden layer
model.add(Dense(64, init='uniform'))
model.add(BatchNormalization())
model.add(Activation('tanh'))
model.add(Dropout(0.5))
# we can think of this chunk as the output layer
model.add(Dense(2, init='uniform'))
model.add(BatchNormalization())
model.add(Activation('softmax'))
# setting up the optimization of our weights
sgd = SGD(lr=0.1, decay=1e-6, momentum=0.9, nesterov=True)
model.compile(loss='binary_crossentropy', optimizer=sgd)
# running the fitting
model.fit(X_train, y_train, nb_epoch=20, batch_size=16, show_accuracy=True, validation_split=0.2, verbose = 2)
希望这可以使事情更清楚.
Hope this clarifies things a bit more.
这篇关于我在哪里可以在Keras中调用BatchNormalization函数?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!