用numpy实现RNN [英] implementing RNN with numpy

查看：334 发布时间：2020/5/18 21:17:37 python numpy recurrent-neural-network rnn

本文介绍了用numpy实现RNN的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试使用numpy实现递归神经网络.

I'm trying to implement the recurrent neural network with numpy.

我当前的输入和输出设计如下:

My current input and output designs are as follow:

x的形状:(序列长度，批量大小，输入尺寸)

x is of shape: (sequence length, batch size, input dimension)

h :(层数，方向数，批处理大小，隐藏大小)

h : (number of layers, number of directions, batch size, hidden size)

initial weight :(方向数2 *隐藏尺寸，输入尺寸+隐藏尺寸)

initial weight: (number of directions, 2 * hidden size, input size + hidden size)

weight :(层数-1，方向数，隐藏大小，方向*隐藏大小+隐藏大小)

weight: (number of layers -1, number of directions, hidden size, directions*hidden size + hidden size)

bias :(层数，方向数，隐藏大小)

bias: (number of layers, number of directions, hidden size)

我已经查找了RNN的pytorch API作为参考(

I have looked up pytorch API of RNN the as reference (https://pytorch.org/docs/stable/nn.html?highlight=rnn#torch.nn.RNN), but have slightly changed it to include initial weight as input. (output shapes are supposedly the same as in pytorch)

它正在运行时，由于我正在输入随机生成的数字作为输入，因此无法确定它是否运行正常.

While it is running, I cannot determine whether it is behaving right, as I am inputting randomly generated numbers as input.

特别是，我不确定我的输入形状是否设计正确.

In particular, I am not so certain whether my input shapes are designed correctly.

有什么专家可以给我指导吗?

Could any expert give me a guidance?

def rnn(xs, h, w0, w=None, b=None, num_layers=2, nonlinearity='tanh', dropout=0.0, bidirectional=False, training=True):
    num_directions = 2 if bidirectional else 1
    batch_size = xs.shape[1]
    input_size = xs.shape[2]
    hidden_size = h.shape[3]
    hn = []
    y = [None]*len(xs)

    for l in range(num_layers):
        for d in range(num_directions):
            if l==0 and d==0:
                wi = w0[d, :hidden_size,  :input_size].T
                wh = w0[d, hidden_size:,  input_size:].T
                wi = np.reshape(wi, (1,)+wi.shape)
                wh = np.reshape(wh, (1,)+wh.shape)
            else:
                wi = w[max(l-1,0), d, :,  :hidden_size].T
                wh = w[max(l-1,0), d, :,  hidden_size:].T
            for i,x in enumerate(xs):
                if l==0 and d==0:
                    ht = np.tanh(np.dot(x, wi) + np.dot(h[l, d], wh) + b[l, d][np.newaxis])
                    ht = np.reshape(ht,(batch_size, hidden_size)) #otherwise, shape is (bs,1,hs)
                else:
                    ht = np.tanh(np.dot(y[i], wi) + np.dot(h[l, d], wh) + b[l, d][np.newaxis])
                y[i] = ht
            hn.append(ht)
    y = np.asarray(y)
    y = np.reshape(y, y.shape+(1,))
    return np.asarray(y), np.asarray(hn)

用numpy实现RNN [英] implementing RNN with numpy

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

用numpy实现RNN [英] implementing RNN with numpy

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭