简体繁体中英

Dropout for LSTM recurrent weights in tensorflow

原文 2017-09-15 15:07:46 2 1 python/ tensorflow/ lstm

Tensorflow's DropoutWrapper allows to apply dropout to either the cell's inputs, outputs or states. However, I haven't seen an option to do the same thing for the recurrent weights of the cell (4 out of the 8 different matrices used in the original LSTM formulation). I just wanted to check that this is the case before implementing a Wrapper of my own.

EDIT:

Apparently this functionality has been added in newer versions (my original comment referred to v1.4): https://github.com/tensorflow/tensorflow/issues/13103

1 answers

It's because original LSTM model only applies dropout on the input and output layers (only to the non-recurrent layers.) This paper is considered as a "textbook" that describes the LSTM with dropout: https://arxiv.org/pdf/1409.2329.pdf

Recently some people tried applying dropout in recurrent layers as well. If you want to look at the implementation and the math behind it, search for "A Theoretically Grounded Application of Dropout in Recurrent Neural Networks" by Yarin Gal. I'm not sure Tensorflow or Keras already implemented this approach though.

Tensorflow LSTM Gate weights

Weights and Bias dimensions in TensorFlow for LSTM

How can I access the weights of a recurrent cell in Tensorflow?

How to visualize RNN/LSTM weights in Keras/TensorFlow?

Default Initialization for Tensorflow LSTM states and weights?

In what order are weights saved in a LSTM kernel in Tensorflow

how to load pretrained LSTM models weights in Tensorflow

PyTorch LSTM dropout vs Keras LSTM dropout

Dropout in activations or in weights

Tensorflow LSTM: How to use different weights for each batch?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Tensorflow LSTM Gate weights Weights and Bias dimensions in TensorFlow for LSTM How can I access the weights of a recurrent cell in Tensorflow? How to visualize RNN/LSTM weights in Keras/TensorFlow? Default Initialization for Tensorflow LSTM states and weights? In what order are weights saved in a LSTM kernel in Tensorflow how to load pretrained LSTM models weights in Tensorflow PyTorch LSTM dropout vs Keras LSTM dropout Dropout in activations or in weights Tensorflow LSTM: How to use different weights for each batch?

Related Tags

Dropout for LSTM recurrent weights in tensorflow

Question

1 answers

solution1 0 2017-09-22 06:14:23

solution1
0 2017-09-22 06:14:23