简体繁体中英

POS tagging in nltk

原文 2014-09-07 19:44:43 0 1 python/ text/ nltk

Hi is there an efficient way for tagging parts of speech in very large files?

 import pandas as pd
 import collections 
 import nltk 

 tokens=nltk.word_tokenize(pandas_dataframe)
 tag1=nltk.pos_tag(tokens)
 counts=collections.counter([y for x,y  in tag1])

I am trying to find the most common parts of speech in a file and don't know of a better way of doing this

1 answers

Typically you need to get around the for loop, possible high memory load and possible high CPU load.

Here's an example of distributed part of speech tagging using python and execnet.

NLTK PoS tagging

POS tagging in spanish with NLTK?

NLTK pos tagging for Arabic

TextBlob and NLTK POS tagging accuracy

spacy instead of nltk for POS tagging

Multilingual NLTK for POS Tagging and Lemmatizer

Custom POS tagging with NLTK (error)

POS tagging - NLTK- Python

POS tagging - NLTK thinks noun is adjective

POS tagging german text using NLTK

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question NLTK PoS tagging POS tagging in spanish with NLTK? NLTK pos tagging for Arabic TextBlob and NLTK POS tagging accuracy spacy instead of nltk for POS tagging Multilingual NLTK for POS Tagging and Lemmatizer Custom POS tagging with NLTK (error) POS tagging - NLTK- Python POS tagging - NLTK thinks noun is adjective POS tagging german text using NLTK

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM