I've been utilizing lots of different corpora for natural language processing, and I've been looking for a corpus that has been annotated with Wordnet Word Senses.
I understand that there probably is not a big corpus with this information, since the corpus needs to be built up manually, but there has to be something to go off of.
Also if there isn't a corpus in existence, is there at least a sense annotated ngram database (with what percentage of the time a word is each of its definitions, or a numerical count of each wordnet definition depending on how common the word sense is)?
Three prominent corpora annotated for WordNet:
you can use senseval2, for java there is a semcor format and (jSemcor API) and also senseval3. these two corpus are used for Word sense disambiguation.
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.