简体繁体中英

Using LDA Topic Models as a Classification Model Input

原文 2019-12-05 05:17:40 7 1 python/ lda/ topic-modeling

I made the LDA model to make topic model using big training data sets. So, I try to use this LDA model to classification using new sentence which it doesn't use in the training data set.

How I can find the most closet topic number using a new input sentence?

Should I use LDA Topic Models as a Classification Model Input?

Welcome to share example code using Python.

1 answers

In classification problems, since the ground-truth label is known, we only need to consider how to extract features from the training data. For LDA, the features are usually the topic probability distribution, ie if there are 5 topics in the corpus, then the dimension of the feature vector is 5, and that should be a better feature than the closet topic number (the most probable topic).

For how to get topic probability distribution for new input sentences, you can take a look at here , for other packages, they should also have similar functions.

Prepare dataset for the LDA topic models using CountVectorizer

Topic modelling using LDA

Error in visualizing LDA Topic Model

LDA topic modeling input data

Topic Modelling: WordCloud For Every Topic in LDA model

Using LDA(topic model) : the distrubution of each topic over words are similar and “flat”

Gensim LDA model topic diff resulting in nan

How to get topic of new document in LDA model

How to predict the topic of a new query using a trained LDA model using gensim?

Error of using tfidf as the input of method 'model.fit()' in package lda

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Prepare dataset for the LDA topic models using CountVectorizer Topic modelling using LDA Error in visualizing LDA Topic Model LDA topic modeling input data Topic Modelling: WordCloud For Every Topic in LDA model Using LDA(topic model) : the distrubution of each topic over words are similar and “flat” Gensim LDA model topic diff resulting in nan How to get topic of new document in LDA model How to predict the topic of a new query using a trained LDA model using gensim? Error of using tfidf as the input of method 'model.fit()' in package lda

Related Tags

Using LDA Topic Models as a Classification Model Input

Question

1 answers

solution1 0 2020-01-14 02:58:24

solution1
0 2020-01-14 02:58:24