简体繁体中英

Output Projection in Seq2Seq model Tensorflow

原文 2016-09-19 12:27:58 4 1 python/ tensorflow

I'm going through the translation code implemented by tensorflow using seq2seq model. I'm following tensorflow tutorial about seq2seq model .

In that tutorial there is a part explaining a concept called output projection which they have implemented in seq2seq_model.py code. I understand the code. But I don't understand what this output projection part is doing.

It would be great if someone can explain me what is going on behind this output projection thing..?

Thank You!!

1 answers

Internally, a neural network operates on dense vectors of some size, often 256, 512 or 1024 floats (let's say 512 for here). But at the end it needs to predict a word from the vocabulary which is often much larger, eg, 40000 words. Output projection is the final linear layer that converts (projects) from the internal representation to the larger one. So, for example, it can consist of a 512 x 40000 parameter matrix and a 40000 parameter for the bias vector. The reason it is kept separate in seq2seq code is that some loss functions (eg, the sampled softmax loss) need direct access to the final 512-sized vector and the output projection matrix. Hope that helps!

BLSTM encoder in seq2seq model Tensorflow

tensorflow error on running the seq2seq model

modify projection layer for seq2seq in tensorflow - python

how to use output_projection (weights, biases) in seq2seq?

Adapting Tensorflow RNN Seq2Seq model code for Tensorflow 2.0

Tensorflow: Attention output gets concatenated with the next decoder input causing dimension missmatch in seq2seq model

NStepLSTM and Seq2Seq model

Error when building seq2seq model with tensorflow

How to get Tensorflow seq2seq embedding output

Find the probability of a given output sequence in Tensorflow Seq2Seq?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question BLSTM encoder in seq2seq model Tensorflow tensorflow error on running the seq2seq model modify projection layer for seq2seq in tensorflow - python how to use output_projection (weights, biases) in seq2seq? Adapting Tensorflow RNN Seq2Seq model code for Tensorflow 2.0 Tensorflow: Attention output gets concatenated with the next decoder input causing dimension missmatch in seq2seq model NStepLSTM and Seq2Seq model Error when building seq2seq model with tensorflow How to get Tensorflow seq2seq embedding output Find the probability of a given output sequence in Tensorflow Seq2Seq?

Related Tags

Output Projection in Seq2Seq model Tensorflow

Question

1 answers

solution1 4 ACCPTED 2016-09-19 16:52:10

solution1
4 ACCPTED 2016-09-19 16:52:10