简体繁体中英

Training RNN on GPU - which tf.keras layer should I use?

原文 2019-08-05 14:27:12 5 1 python/ tensorflow/ keras/ tf.keras

I am training RNNs, which I built using tf.keras.layers.GRU layers. They are taking a long time to train (>2 hours), so I am going to deploy them to the GPU for training. I am wondering a few things about training on GPU:

What is the difference between tf.keras.layers.CuDNNGRU and tf.keras.layers.GRU (and also tf.keras.layers.LSTM vs. tf.keras.layers.CuDNNLSTM )? I understand from this post that CuDNNGRU layers train faster than GRU layers, but
- Do the 2 layers converge to different results with the same seed?
- Do the 2 layers perform the same during inference?
- Do CuDNN layers require a GPU during inference?
- Can GRU layers run inference on a GPU?
- Are CuDNN layers easily deployable? I am currently using coremlconverter to convert my keras model to CoreML for deployment.
Is there an equivalent CuDNN layer for tf.keras.layers.SimpleRNN (ie tf.keras.layers.CuDNNSimpleRNN )? I am not committed to a specific architecture yet, and so I believe I would need the tf.keras.layers.CuDNNSimpleRNN layer if I decide on SimpleRNNs and the CuDNN layer has some functionality that I need.
With CuDNN layers, do I need to have tensorflow-gpu installed? Or do they still get deployed to the GPU as long as I have the relevant drivers installed?

1 answers

if you are using a cuda compatible gpu, it makes absolutely sense to use CuDNN layers. They have a different implementation that tries to overcome computation parallelization issues inherent in the RNN architecture. They usually perform a bit worst though but are 3x-6x faster https://twitter.com/fchollet/status/918170264608817152?lang=en

Do the 2 layers converge to different results with the same seed?

yes

Do the 2 layers perform the same during inference?

You should have a comparable performance but not exactly the same

Do CuDNN layers require a GPU during inference?

Yes but you can convert to a CuDNN compatible GRU/LSTM

Can GRU layers run inference on a GPU?

Yes

With CuDNN layers, do I need to have tensorflow-gpu installed? Or do they still get deployed to the GPU as long as I have the relevant drivers installed?

Yes and you need a cuda compatible gpu

tf.keras input layer only for use during inference

tf.keras (RNN) Layer issues when running model.fit()

How do you apply layer normalization in an RNN using tf.keras?

TF.Keras SparseCategoricalCrossEntropy return nan on GPU

Resume Training tf.keras Tensorboard

tf.keras "All layer names should be unique." but layer names are already changed

Use tf.Keras with Tensorflow optimizer

How can I choose the value of output neurons for the hidden layer of tf.keras model?

How do I know which version of the Keras API is implemented in tf.keras?

tf.keras get computed gradient during training

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question tf.keras input layer only for use during inference tf.keras (RNN) Layer issues when running model.fit() How do you apply layer normalization in an RNN using tf.keras? TF.Keras SparseCategoricalCrossEntropy return nan on GPU Resume Training tf.keras Tensorboard tf.keras "All layer names should be unique." but layer names are already changed Use tf.Keras with Tensorflow optimizer How can I choose the value of output neurons for the hidden layer of tf.keras model? How do I know which version of the Keras API is implemented in tf.keras? tf.keras get computed gradient during training

Related Tags

Training RNN on GPU - which tf.keras layer should I use?

Question

1 answers

solution1 0 ACCPTED 2019-08-06 17:21:10

solution1
0 ACCPTED 2019-08-06 17:21:10