简体繁体中英

How does language model evaluation work with unknown words?

原文 2017-10-12 15:03:12 1 1 language-model/ perplexity

So for building language models, less frequent words ranked beyond vocabulary size are replaced as 'UNK'.

My question is, how to evaluate such language models that evaluates probabilities based on 'UNK'? Say we want to evaluate the perplexity of such a language model on a test set, for words unknown to the model, the probability we get is evaluated based on a 'bag' of unknown words.

This seems problematic because if we set the vocabulary size as 1, ie all words are unknown, then the perplexity of this can-do-nothing language model is going to be 1.

1 answers

this file explains the question very well:

https://web.stanford.edu/~jurafsky/slp3/4.pdf

in short, perplexity should only be compared between language models with the same vocabulary.

How does Ulmfit's language model work when applied on a text classification problem?

Creating ARPA language model file with 50,000 words

What does 'theta' mean in a language model?

How to tune a Machine Translation model with huge language model?

How to load spacy language model from local machine?

language model with SRILM

Check perplexity of a Language Model

padding and attention mask does not work as intended in batch input in GPT language model

Access spaCy Masked Language Model

How to relate the language model score of a whole sentence to those of the sentence's constituents

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How does Ulmfit's language model work when applied on a text classification problem? Creating ARPA language model file with 50,000 words What does 'theta' mean in a language model? How to tune a Machine Translation model with huge language model? How to load spacy language model from local machine? language model with SRILM Check perplexity of a Language Model padding and attention mask does not work as intended in batch input in GPT language model Access spaCy Masked Language Model How to relate the language model score of a whole sentence to those of the sentence's constituents

Related Tags

How does language model evaluation work with unknown words?

Question

1 answers

solution1 0 2017-10-12 20:48:07

solution1
0 2017-10-12 20:48:07