了解Keras LSTM [英] Understanding Keras LSTMs

查看：256 发布时间：2020/4/25 9:34:22 python deep-learning keras lstm

本文介绍了了解Keras LSTM的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我试图调和对LSTM的理解，并在此处指出Christopher Olah的帖子在Keras中实施.我正在关注由Jason Brownlee撰写的博客适用于Keras教程.我主要感到困惑的是，

I am trying to reconcile my understand of LSTMs and pointed out here in this post by Christopher Olah implemented in Keras. I am following the blog written by Jason Brownlee for the Keras tutorial. What I am mainly confused about is,

将数据系列重塑为[samples, time steps, features]和
状态LSTM

让我们参考下面粘贴的代码集中讨论以上两个问题:

Lets concentrate on the above two questions with reference to the code pasted below:

# reshape into X=t and Y=t+1
look_back = 3
trainX, trainY = create_dataset(train, look_back)
testX, testY = create_dataset(test, look_back)

# reshape input to be [samples, time steps, features]
trainX = numpy.reshape(trainX, (trainX.shape[0], look_back, 1))
testX = numpy.reshape(testX, (testX.shape[0], look_back, 1))
########################
# The IMPORTANT BIT
##########################
# create and fit the LSTM network
batch_size = 1
model = Sequential()
model.add(LSTM(4, batch_input_shape=(batch_size, look_back, 1), stateful=True))
model.add(Dense(1))
model.compile(loss='mean_squared_error', optimizer='adam')
for i in range(100):
    model.fit(trainX, trainY, nb_epoch=1, batch_size=batch_size, verbose=2, shuffle=False)
    model.reset_states()

注意:create_dataset接受一个长度为N的序列，并返回一个N-look_back数组，其中每个元素都是一个look_back长度序列.

Note: create_dataset takes a sequence of length N and returns a N-look_back array of which each element is a look_back length sequence.

可以看出TrainX是一个3D数组，其中Time_steps和Feature是最后两个维度(在此特定代码中为3和1).对于下图，这是否意味着我们正在考虑many to one情况，其中粉红色盒子的数量为3?还是字面上的意思是链长为3(即仅考虑了3个绿色框).

As can be seen TrainX is a 3-D array with Time_steps and Feature being the last two dimensions respectively (3 and 1 in this particular code). With respect to the image below, does this mean that we are considering the many to one case, where the number of pink boxes are 3? Or does it literally mean the chain length is 3 (i.e. only 3 green boxes considered).

当我们考虑多元序列时，features自变量是否有意义?例如同时建模两个金融股票?

Does the features argument become relevant when we consider multivariate series? e.g. modelling two financial stocks simultaneously?

有状态LSTM是否意味着我们在批次运行之间保存单元内存值?在这种情况下，batch_size为1，并且在两次训练之间将内存重置，因此说出它是有状态的意思是什么.我猜想这与训练数据没有被改组这一事实有关，但是我不确定如何做到这一点.

Does stateful LSTMs mean that we save the cell memory values between runs of batches? If this is the case, batch_size is one, and the memory is reset between the training runs so what was the point of saying that it was stateful. I'm guessing this is related to the fact that training data is not shuffled, but I'm not sure how.

有什么想法吗? 图片参考: http://karpathy.github.io/2015/05/21 /rnn-efficiency/

Any thoughts? Image reference: http://karpathy.github.io/2015/05/21/rnn-effectiveness/

@van对红色和绿色方框相等的评论感到有些困惑.因此，为了确认一下，以下API调用是否与展开的图相对应?尤其要注意第二张图(任意选择batch_size.):

A bit confused about @van's comment about the red and green boxes being equal. So just to confirm, does the following API calls correspond to the unrolled diagrams? Especially noting the second diagram (batch_size was arbitrarily chosen.):

对于已经完成Udacity深度学习课程但仍对time_step参数感到困惑的人，请查看以下讨论:

For people who have done Udacity's deep learning course and still confused about the time_step argument, look at the following discussion: https://discussions.udacity.com/t/rnn-lstm-use-implementation/163169

事实证明，model.add(TimeDistributed(Dense(vocab_len)))是我一直在寻找的东西.这是一个示例: https://github.com/sachinruk/ShakespeareBot

It turns out model.add(TimeDistributed(Dense(vocab_len))) was what I was looking for. Here is an example: https://github.com/sachinruk/ShakespeareBot

我在这里总结了我对LSTM的大部分理解: https://www.youtube.com /watch?v = ywinX5wgdEU

I have summarised most of my understanding of LSTMs here: https://www.youtube.com/watch?v=ywinX5wgdEU

了解Keras LSTM [英] Understanding Keras LSTMs

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

了解Keras LSTM [英] Understanding Keras LSTMs

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭