Keras LSTM 预测的时间序列被挤压和移位 [英] Keras LSTM predicted timeseries squashed and shifted

查看：28 发布时间：2022/1/11 9:20:02 python machine-learning time-series keras lstm

本文介绍了Keras LSTM 预测的时间序列被挤压和移位的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试在假期中获得一些使用 Keras 的经验，我想我会从教科书的股票数据时间序列预测示例开始.所以我要做的是给定过去 48 小时的平均价格变化(自上一个以来的百分比)，预测未来一小时的平均价格变化是多少.

I'm trying to get some hands on experience with Keras during the holidays, and I thought I'd start out with the textbook example of timeseries prediction on stock data. So what I'm trying to do is given the last 48 hours worth of average price changes (percent since previous), predict what the average price chanege of the coming hour is.

但是，在针对测试集(甚至是训练集)进行验证时，预测序列的幅度相差甚远，有时会转移为始终为正或始终为负，即偏离 0% 的变化，我认为这对于这种事情是正确的.

However, when verifying against the test set (or even the training set) the amplitude of the predicted series is way off, and sometimes is shifted to be either always positive or always negative, i.e., shifted away from the 0% change, which I think would be correct for this kind of thing.

我想出了以下最小示例来说明问题:

I came up with the following minimal example to show the issue:

df = pandas.DataFrame.from_csv('test-data-01.csv', header=0)
df['pct'] = df.value.pct_change(periods=1)

seq_len=48
vals = df.pct.values[1:] # First pct change is NaN, skip it
sequences = []
for i in range(0, len(vals) - seq_len):
    sx = vals[i:i+seq_len].reshape(seq_len, 1)
    sy = vals[i+seq_len]
    sequences.append((sx, sy))

row = -24
trainSeqs = sequences[:row]
testSeqs = sequences[row:]

trainX = np.array([i[0] for i in trainSeqs])
trainy = np.array([i[1] for i in trainSeqs])

model = Sequential()
model.add(LSTM(25, batch_input_shape=(1, seq_len, 1)))
model.add(Dense(1))
model.compile(loss='mse', optimizer='adam')
model.fit(trainX, trainy, epochs=1, batch_size=1, verbose=1, shuffle=True)

pred = []
for s in trainSeqs:
    pred.append(model.predict(s[0].reshape(1, seq_len, 1)))
pred = np.array(pred).flatten()

plot(pred)
plot([i[1] for i in trainSeqs])
axis([2500, 2550,-0.03, 0.03])

如您所见，我创建了训练和测试序列，方法是选择最后 48 小时，下一步进入一个元组，然后前进 1 小时，重复该过程.该模型是一个非常简单的 1 个 LSTM 和 1 个密集层.

As you can see, I create training and testing sequences, by selecting the last 48 hours, and the next step into a tuple, and then advancing 1 hour, repeating the procedure. The model is a very simple 1 LSTM and 1 dense layer.

我本来希望单个预测点的图与训练序列图很好地重叠(毕竟这是他们训练的同一组)，并且与测试序列匹配.但是，我在 训练数据 上得到以下结果:

I would have expected the plot of individual predicted points to overlap pretty nicely the plot of training sequences (after all this is the same set they were trained on), and sort of match for the test sequences. However I get the following result on training data:

橙色:真实数据
蓝色:预测数据

知道会发生什么吗?我是不是误会了什么?

Any idea what might be going on? Did I misunderstand something?

更新:为了更好地展示我所说的移位和压扁的意思，我还绘制了预测值，方法是将其移回以匹配真实数据并相乘以匹配幅度.

Update: to better show what I mean by shifted and squashed I also plotted the predicted values by shifting it back to match the real data and multiplied to match the amplitude.

plot(pred*12-0.03)
plot([i[1] for i in trainSeqs])
axis([2500, 2550,-0.03, 0.03])

正如您所见，预测与真实数据非常吻合，只是以某种方式被挤压和偏移，我不知道为什么.

As you can see the prediction nicely fits the real data, it's just squashed and offset somehow, and I can't figure out why.

Keras LSTM 预测的时间序列被挤压和移位 [英] Keras LSTM predicted timeseries squashed and shifted

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

Keras LSTM 预测的时间序列被挤压和移位 [英] Keras LSTM predicted timeseries squashed and shifted

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭