[英]BerTopic Model - Visualization ignores 0th index
The BerTopic model resulted the below Topics: BerTopic model 产生了以下主题:
As you can see from the above, the model is finetuned to generate lesser outliers '-1' which has the count of 3 and it appears in the last.从上面可以看出,model 被微调以生成较小的异常值“-1”,其计数为 3,并出现在最后。
While visualizing the Topics per class , 根据 class 可视化主题时,
topic_model.visualize_topics_per_class(topics_per_class)
the below interactive visual is generated, and however it ignored the 0th
index, to be precise the Topic 0. The Global Topic Representations are displayed from 1, 2, 3, 4, 5, 6, -1
生成了以下交互式视觉效果,但是它忽略了
0th
个索引,准确地说是主题 0。全局主题表示从1, 2, 3, 4, 5, 6, -1
显示
Is the BerTopic designed in a way that it always assumes the very first index will be an outlier ( -1
), and eliminates it blindly? BerTopic 的设计方式是否总是假设第一个索引将是异常值 (
-1
),并盲目地消除它?
Are the generated topics always accessed based on the count size, may be in descending order?生成的主题是否总是根据计数大小访问,可能是按降序排列的?
This issue is posted in the BerTopic github forum as well, and the response from the Author himself,这个问题也发布在BerTopic github论坛,以及作者本人的回复,
by setting top_n_topics=None
, all the topics along with the 0th
index can be viewed while visualizing,通过设置
top_n_topics=None
,可以在可视化的同时查看所有主题以及0th
个索引,
topic_model.visualize_topics_per_class(topics_per_class, top_n_topics=None)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.