尝试对具有屏蔽输入的LSTM Seq2Seq进行推理时，CUDNN_STATUS_BAD_PARAM [英] CUDNN_STATUS_BAD_PARAM when trying to perform inference on a LSTM Seq2Seq with masked inputs

查看：206 发布时间：2021/5/30 21:30:30 python tensorflow lstm tensorflow2.0 tf.keras

本文介绍了尝试对具有屏蔽输入的LSTM Seq2Seq进行推理时，CUDNN_STATUS_BAD_PARAM的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用 tensorflow 2.0 上的keras图层来构建一个简单的基于LSTM的Seq2Seq模型以生成文本.

版本我正在使用:Python 3.6.9，Tensorflow 2.0.0，CUDA 10.0，CUDNN 7.6.1，Nvidia驱动程序版本410.78.

我知道TF需要的

这些是推断模型:

  encoder_model = tf.keras.models.Model(编码器输入，编码器状态)coder_state_input_h = tf.keras.layers.Input(形状=(input_dimension，))encoder_state_input_c = tf.keras.layers.Input(shape =(input_dimension，))decoder_states_inputs = [decoder_state_input_h,decoder_state_input_c]解码器输出，状态_h，状态c =解码器_lstm_layer(coder_embedding_layer，initial_state = decoder_states_inputs)coder_states = [state_h，state_c]coder_outputs = output_layer(解码器输出)coder_model = tf.keras.models.Model([解码器输入] +解码器状态输入[解码器输出] +解码器状态)

当我在 encoder_model 上调用 predict()时，我得到CUDNN_STATUS_BAD_PARAM

推理代码(触发错误的地方)

 #使用右填充的输入序列构建初始状态#### CUDNN_STATUS_BAD_PARAM已在此行触发！！！########<<<<<<<<状态= encoder_model.predict(masked_input_sequence)empty_target_sequence = np.zeros((1,1))#这标志着序列的开始empty_target_sequence [0,0] = titles_word_index [sos_token]解码器输出，h，c =解码器模型.预测([empty_target_sequence] +状态)

我尝试过的事情

显式创建掩码( encoder_embedding_layer.compute_mask())，并在每次调用LSTM层时将其作为参数添加，例如:

  encoder_embedding = encoder_embedding_layer(encoder_inputs)encoder_mask = encoder_embedding_layer.compute_mask(encoder_inputs)_，state_h，state_c = tf.keras.layers.LSTM(EMBEDDING_DIMS，return_state = True)(编码器嵌入，掩码=编码器掩码)

不对嵌入层使用初始化程序，以查看问题是否存在

PS: 强制在CPU上进行训练会使错误消失，但我需要在GPU上进行训练，否则需要一段时间才能完成.

PS:这似乎是我遇到的错误:屏蔽LSTM:OP_REQUIRES在cudnn_rnn_ops.cc:1498上失败:未知:CUDNN_STATUS_BAD_PARAM

PS::当我在 model ， encoder_model 和 decoder_model supports_masking 时>，由于某种原因，它们都返回 False .

PS:就像我说的那样，培训没有任何(明显的)错误，但是如果我在命令行上查看Jupyter输出日志，则可以在执行过程中看到以下警告消息培训:

  2019-11-16 19:48:20.144265:Wtensorflow/core/grappler/optimizers/implementation_selector.cc:310]加载函数库时由于错误而跳过了优化:无效的参数:函数'__inference___backward_cudnn_lstm_with_fallback_47598_49057'和'__inference___backward_cudnn_lstm_with_fallback_47598_49057_specialized_for_StatefulPartitionedCall_1_at___inference_distributed_function_52868'两者都实现了"lstm_d41d5ccb-14be-4a74-b5e8-cc4f63c5bb02"，但它们的签名不匹配.

解决方案

您应使用cudnn7.4引用此criteria needed by TF to delegate to CUDNNLstm when a GPU is present (I do have a GPU and my model/data fill all these criteria).

Training goes smoothly (with a warning message, see the end of this post) and I can verify that CUDNNLstm is being used.

However, when I try to call encoder_model.predict(input_sequence) at inference time, I get the following error message:

UnknownError:  [_Derived_]  CUDNN_STATUS_BAD_PARAM
in tensorflow/stream_executor/cuda/cuda_dnn.cc(1424): 'cudnnSetRNNDataDescriptor( data_desc.get(), data_type, layout, max_seq_length, batch_size, data_size, seq_lengths_array, (void*)&padding_fill)'
     [[{{node cond/then/_0/CudnnRNNV3}}]]
     [[lstm/StatefulPartitionedCall]] [Op:__inference_keras_scratch_graph_91878]

Function call stack:
keras_scratch_graph -> keras_scratch_graph -> keras_scratch_graph

Here is the training code: (both source_sequences and target_sequences are right-padded sequences and the embedding matrices are pretrained Glove embeddings)

# Define an input sequence and process it.
encoder_inputs = tf.keras.layers.Input(shape=(24,))
encoder_embedding_layer = tf.keras.layers.Embedding(
  VOCABULARY_SIZE_1,
  EMBEDDING_DIMS,
  embeddings_initializer=initializers.Constant(encoder_embedding_matrix),
  mask_zero=True)
encoder_embedding = encoder_embedding_layer(encoder_inputs)

_, state_h, state_c = tf.keras.layers.LSTM(
  EMBEDDING_DIMS,
  implementation=1,
  return_state=True)(encoder_embedding)

encoder_states = [state_h, state_c]

decoder_inputs = tf.keras.layers.Input(shape=(24,))
decoder_embedding_layer = tf.keras.layers.Embedding(
  VOCABULARY_SIZE_2,
  EMBEDDING_DIMS,
  embeddings_initializer=initializers.Constant(decoder_embedding_matrix),
  mask_zero=True)
decoder_embedding = decoder_embedding_layer(decoder_inputs)

decoder_lstm = tf.keras.layers.LSTM(
    EMBEDDING_DIMS, 
    return_sequences=True, 
    return_state=True,
    implementation=1)

decoder_outputs, _, _ = decoder_lstm(decoder_embedding, initial_state=encoder_states)

decoder_dense = tf.keras.layers.Dense(VOCABULARY_SIZE_TITLE, activation='softmax')

output = decoder_dense(decoder_outputs)

model = tf.keras.models.Model([encoder_inputs, decoder_inputs], output)

model.compile(optimizer='rmsprop', loss='sparse_categorical_crossentropy')
model.summary()

model.fit([source_sequences, target_sequences], decoder_target_data,
    batch_size=32,
    epochs=10,
    validation_split=0.0,
    verbose=2)

These are the inference models:

encoder_model = tf.keras.models.Model(encoder_inputs, encoder_states)

decoder_state_input_h = tf.keras.layers.Input(shape=(input_dimension ,))
decoder_state_input_c = tf.keras.layers.Input(shape=(input_dimension ,))

decoder_states_inputs = [decoder_state_input_h, decoder_state_input_c]

decoder_outputs, state_h, state_c = decoder_lstm_layer(
        decoder_embedding_layer , initial_state=decoder_states_inputs)

decoder_states = [state_h, state_c]

decoder_outputs = output_layer(decoder_outputs)
decoder_model = tf.keras.models.Model(
        [decoder_inputs] + decoder_states_inputs,
        [decoder_outputs] + decoder_states)

When I call predict() on the encoder_model, I get CUDNN_STATUS_BAD_PARAM

Inference code (where error gets triggered)

# build the initial state with a right-padded input sequence
#### CUDNN_STATUS_BAD_PARAM is TRIGGERED ON THIS LINE!!! ######## <<<<<<<<<
state = encoder_model.predict(masked_input_sequence)

empty_target_sequence = np.zeros((1,1))
# this signals the Start of sequence
empty_target_sequence[0,0] = titles_word_index[sos_token]

decoder_outputs, h, c = decoder_model.predict([empty_target_sequence] + state)

Things I have tried

create masks explicitly (encoder_embedding_layer.compute_mask()) and add them as parameters every time I call an LSTM layer, for example:

encoder_embedding = encoder_embedding_layer(encoder_inputs)

encoder_mask = encoder_embedding_layer.compute_mask(encoder_inputs)

_, state_h, state_c = tf.keras.layers.LSTM(
  EMBEDDING_DIMS,
  return_state=True)(encoder_embedding,mask=encoder_mask)

not use initializers for the embedding layers to see if the problem was there

P.S.: forcing the training to take place on a CPU makes the error go away but I need to train it on GPU otherwise it would take ages to complete.

P.S.: This seems to be the very same error I have: Masking LSTM: OP_REQUIRES failed at cudnn_rnn_ops.cc:1498 : Unknown: CUDNN_STATUS_BAD_PARAM

P.S.: when I call method supports_masking on model,encoder_model and decoder_model, all of them return False for some reason.

P.S.: Like I said, training is done with no (apparent) errors but if I look at the Jupyter output log on the command line, I can see the following warning message during training:

2019-11-16 19:48:20.144265: W 
tensorflow/core/grappler/optimizers/implementation_selector.cc:310] Skipping optimization due to error while loading function libraries: 
Invalid argument: Functions '__inference___backward_cudnn_lstm_with_fallback_47598_49057' and 
'__inference___backward_cudnn_lstm_with_fallback_47598_49057_specialized_for_StatefulPartitionedCall_1_at___inference_distributed_function_52868'
 both implement 'lstm_d41d5ccb-14be-4a74-b5e8-cc4f63c5bb02' but their signatures do not match.

解决方案

You should use cudnn7.4 referring to this web

这篇关于尝试对具有屏蔽输入的LSTM Seq2Seq进行推理时，CUDNN_STATUS_BAD_PARAM的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

尝试对具有屏蔽输入的LSTM Seq2Seq进行推理时，CUDNN_STATUS_BAD_PARAM [英] CUDNN_STATUS_BAD_PARAM when trying to perform inference on a LSTM Seq2Seq with masked inputs

问题描述

相关文章

Python最新文章

热门教程

热门工具

登录关闭

尝试对具有屏蔽输入的LSTM Seq2Seq进行推理时，CUDNN_STATUS_BAD_PARAM [英] CUDNN_STATUS_BAD_PARAM when trying to perform inference on a LSTM Seq2Seq with masked inputs

问题描述

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭