简体   繁体   中英

Multi-class for sentence classification with pytorch (Using nn.LSTM)

I have this network, that I took from this tutorial, and I want to have sentences as input (Which is already done) and just a one line tensor as a result.

From the tutorial, this sentence “John's dog likes food”, gets a 1 column tensor returned:

tensor([[-3.0462, -4.0106, -0.6096],
[-4.8205, -0.0286, -3.9045],
[-3.7876, -4.1355, -0.0394],
[-0.0185, -4.7874, -4.6013]])

...and class list:

tag_list[ “name”, “verb”, “noun”]

Each line has the probability of a tag being associated with the word. (The first word has [-3.0462, -4.0106, -0.6096 ] vector where the last element corresponds to the maximum scoring tag, "noun")

The tutorial's dataset looks like this:

training_data = [
    ("The dog ate the apple".split(), ["DET", "NN", "V", "DET", "NN"]),
    ("Everybody read that book".split(), ["NN", "V", "DET", "NN"])
]

And I want mine to be of this format:

training_data = [
    ("Hello world".split(), ["ONE"]),
    ("I am dog".split(), ["TWO"]),
    ("It's Britney glitch".split(), ["THREE"])
]

The parameters are defined as:

class LSTMTagger(nn.Module):
    def __init__(self, embedding_dim, hidden_dim, vocab_size, tagset_size):
        super(LSTMTagger, self).__init__()
        self.hidden_dim = hidden_dim
        self.word_embeddings = nn.Embedding(vocab_size, embedding_dim)
        self.lstm = nn.LSTM(embedding_dim, hidden_dim)
        self.hidden2tag = nn.Linear(hidden_dim, tagset_size)

    def forward(self, sentence):
        embeds      = self.word_embeddings(sentence)
        lstm_out, _ = self.lstm(embeds.view(len(sentence), 1, -1))
        tag_space   = self.hidden2tag(lstm_out.view(len(sentence), -1))
        tag_scores  = F.log_softmax(tag_space, dim=1)
        return tag_scores

As of now, the sizes from input and output are not matching and i get: ValueError: Expected input batch_size (2) to match target batch_size (1).

The criterion function doesn't accept the input due to size missmatch it seems:

loss        = criterion(tag_scores, targets)

I've read the last layer could be defined as nn.Linear in order to squash the outputs but i can't seem to get any results. Tried other loss functions

How can I change it in order for the model to classify the sentence , and not each word, as in the original tutorial?

我通过简单地获取最后一个的隐藏状态解决了这个问题

tag_space   = self.hidden2tag(lstm_out[-1])

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM