如何在张量流中将rnn的初始状态设置为参数？

Question

When one uses dynamic_rnn , a parameter called initial_state is required. 当使用dynamic_rnn ，需要一个名为initial_state的参数。 An easy solution is 一个简单的解决方案是

initial_state = lstm_cell.zero_state(batch_size, tf.float32)

But I want to set the initial state as a parameter which can be optimized, how should I do? 但是我想将初始状态设置为可以优化的参数，该怎么办？

I can define two trainable_variables called h0 and c0 , which are two vectors. 我可以定义两个名为h0和c0 ，它们是两个向量。 But dynamic_rnn requires two matrixes where the first dimension is batch_size . 但是dynamic_rnn需要两个矩阵，其中第一个维度是batch_size 。 How could I expand the vector h0 to a matrix whose each row is h0 ? 如何将向量h0扩展到每行均为h0的矩阵？

Answer 1

What if you did something like this? 如果您做了这样的事情怎么办？

initial_state_vector = tf.get_variable('initial_state_vector', [1, state_size])
initial_state = tf.tile(initial_state_vector, batch_size)

Then you can supply the initial_state variable to the LSTM and it will be the proper size. 然后，您可以向LSTM提供initial_state变量，它将是适当的大小。

如何在张量流中将rnn的初始状态设置为参数？

问题描述

1 个解决方案

解决方案1
3 已采纳 2016-10-21 17:24:31

如何在张量流中将rnn的初始状态设置为参数？

问题描述

1 个解决方案

解决方案1 3 已采纳 2016-10-21 17:24:31

解决方案1
3 已采纳 2016-10-21 17:24:31