简体   繁体   中英

Can HDP (Hierarchical Dirichilet Process) detect the number of topics from the data?

I am using HDP (Hierarchical Dirichilet Process) package from gensim topic modelling software. Gensim HDP implementation expects user to provide number of topics (T) in advance.

hdpmodel.HdpModel(self, corpus, id2word,T=150)

The documentation defines T as top level truncation level.

Can HDP determine number of topics on its own? Is there an implementation of HDP which can detect number of topics? Any help is appreciated.

You can try using the stick-breaking prior .

Some people have reported good results. Others said it did not work for them at all.

Guess you will have to implement this yourself and see if it works.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM