简体   繁体   中英

LDA Topic assignment

I have a corpora of ~1,400 documents. I did all text cleansing using tm package. My last step was creation of the DTM matrix. I am trying to train the LDA model based on 200 documents examined by human and topics(categories) that were assigned. Unfortunately, I can't share the reproducible example.

Can someone help how is this performed with one of the freely available data sets as an example?

If you have annotated training data, why don't you use supervised classification techniques like SVM or logistic regression which are pretty good for text classification tasks. Scikit-learn in python has all the implementation for these classifiers and you can directly use them for the classification purpose.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM