POS tagging in python

Question

I am trying to find out the parts of speech in a particular sentence. I tried to do it using the code given below

from nltk import word_tokenize
import nltk.data
a=raw_input()
text = word_tokenize(a)
pairs=nltk.pos_tag(text)
print pairs

But it always shows 'Delete' as JJ(adjective) where it is supposed to be Verb. How can I improve the code? Thanks in advance

Answer 1

First you should get a corpus of correctly tagged sentences (as suggested above). Just augmenting some of the corpora in your nltk_data folder may already be useful. To train your own tagger from this, see: http://nltk-trainer.readthedocs.org/en/latest/train_tagger.html

POS tagging in python

Question

1 answers

solution1
0 2015-03-25 14:58:57

POS tagging in python

Question

1 answers

solution1 0 2015-03-25 14:58:57

solution1
0 2015-03-25 14:58:57