简体繁体中英

Dealing with missing or unknown features when tagging items using CRF model (CRFSuite)

原文 2015-05-20 10:44:56 3 1 python/ missing-data/ crf/ missing-features

I'm using CRFSuite (the python-crfsuite implementation) to build a named-entity-extractor, similar to the tutorial on http://nbviewer.ipython.org/github/tpeng/python-crfsuite/blob/master/examples/CoNLL%202002.ipynb The training input is a sequence of words, each of which has a number of features.

The problem is that for my specific use-case, I don't always have the features of the entities that I'm trying to recognise. I want the CRF model to recognise the entity based on the features of the surrounding words. However, when I simply input an empty dict {} as a word's features, the named entities are never properly classified as such.

I'm wondering if there is a feature or standard method to handle such cases, where after training a model, one does not always have features for all items.

1 answers

在某些情况下，为缺失的功能（例如“-”或“ +”）分配固定值可能会很有用。

How to prepare training corpus for CRF model using CRFSuite

How to use word embedding as features for CRF (sklearn-crfsuite) model training

sklearn_crfsuite.CRF UnicodeEncodeError

Unknown specifier in URL when using Django-Tagging

Numeric conversion of textual features in crfsuite

Implementing BiLSTM-Attention-CRF Model using Pytorch

Shape error when using CRF for binary segmentation in keras

Dealing with missing value in a column using pandas

Tagging of items in Python Pandas

Tagging items in lists

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to prepare training corpus for CRF model using CRFSuite How to use word embedding as features for CRF (sklearn-crfsuite) model training sklearn_crfsuite.CRF UnicodeEncodeError Unknown specifier in URL when using Django-Tagging Numeric conversion of textual features in crfsuite Implementing BiLSTM-Attention-CRF Model using Pytorch Shape error when using CRF for binary segmentation in keras Dealing with missing value in a column using pandas Tagging of items in Python Pandas Tagging items in lists

Related Tags

Dealing with missing or unknown features when tagging items using CRF model (CRFSuite)

Question

1 answers

solution1 0 2015-07-13 14:49:08

solution1
0 2015-07-13 14:49:08