简体繁体中英

how can I use entity class of previous token as a feature for NER while using crfsuite

原文 2015-07-02 22:57:03 2 2 python/ named-entity-recognition/ crf/ crf++

I am using python-crfsuite package in python, an implementation of CRFSUITE developed by Naoaki Okazaki( http://www.chokkan.org/software/crfsuite/ )

I want to use the entity class of previous token as a feature, which will help me in identifying multi-word named entities. my training data example:

[(Raheja,B-builder),(vista,I-builder),(is,O),(very,O),(famous,O)]

here if i can use the previous class feature while training.but while predicting we pass the list of features to the tagger object. the problem while testing is that previous class will not be known.

can anyone tell me if this is possible in python-crfsuite at all. I feel that the way we pass features to the tagger object, it is not possible.

2 answers

I believe this is not possible with crfsuite (and python-crfsuite), based on this sentence in the tutorial :

Features conditioned with attributes and label bigrams are not supported.

Class of the previous token is used as a feature by default in CRFSuite. CRFSuite uses two kinds of features:

"state features" - I(current_label=A and f(sequence, current_position)) ;
"transition features" - I(current_label=A and previous_label=B)

Features you define are in fact f functions in (1); state features are generated for all possible values of the label. To use transition features you don't have to do anything, they are generated by default.

What is not implemented in CRFsuite is a third kind of feature: I(current_label=A and previous_label=B and f(sequence, current_position)) ; this is what tutorial means by "Features conditioned with attributes and label bigrams".

How can I install sklearn crfsuite on pyCharm?

How does spacy use word embeddings for Named Entity Recognition (NER)?

How can we use Spacy minibatch and GoldParse to train NER model using BILUO tagging scheme?

custom feature function with (python) crfsuite

Use tag as attibute in crfsuite

How can I feed multiple of 100 annotated files in training Custom NER model using spacy3

Entity extraction using POS and NER in spacy

visualizing NER training data and entity using displacy

How can I get indexes after getting NER results?

how to use ktrain for NER Offline?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How can I install sklearn crfsuite on pyCharm? How does spacy use word embeddings for Named Entity Recognition (NER)? How can we use Spacy minibatch and GoldParse to train NER model using BILUO tagging scheme? custom feature function with (python) crfsuite Use tag as attibute in crfsuite How can I feed multiple of 100 annotated files in training Custom NER model using spacy3 Entity extraction using POS and NER in spacy visualizing NER training data and entity using displacy How can I get indexes after getting NER results? how to use ktrain for NER Offline?

Related Tags

how can I use entity class of previous token as a feature for NER while using crfsuite

Question

2 answers

solution1
0 2015-09-08 01:33:38

solution2
0 2016-12-05 13:39:53

how can I use entity class of previous token as a feature for NER while using crfsuite

Question

2 answers

solution1 0 2015-09-08 01:33:38

solution2 0 2016-12-05 13:39:53

solution1
0 2015-09-08 01:33:38

solution2
0 2016-12-05 13:39:53