简体   繁体   English

聚类词

[英]Clustering Words

I have a list of words. 我有一个单词列表。 I am looking for a way to cluster these words together semantically. 我正在寻找一种在语义上将这些单词聚集在一起的方法。 Can anyone tell me about a library or tool that accomplishes this? 谁能告诉我有关完成此操作的库或工具? I have been searching this on net a lot but nothing suits my requirements. 我一直在网上搜索这个,但不符合我的要求。 Most of the words are technical and so won't be found in any dictionary. 大多数单词都是技术性的,因此不会在任何字典中找到。 I want to perform ontological clustering on the list of words. 我想在单词列表上执行本体聚类。

Do you have a collection so you will have a context to work with? 你有一个集合,所以你有一个上下文可以使用吗?

If you ha a given collection you can use you can compute the number of documents each pair of terms appear together and, based on that calculate some semantic similarity between them like [Jaccard] ( http://en.wikipedia.org/wiki/Jaccard_index ) or [Dice] ( http://en.wikipedia.org/wiki/Dice%27s_coefficient ). 如果你有一个给定的集合你可以使用你可以计算每对术语一起出现的文档数量,并根据它计算它们之间的一些语义相似性,如[Jaccard]( http://en.wikipedia.org/wiki/ Jaccard_index )或[Dice]( http://en.wikipedia.org/wiki/Dice%27s_coefficient )。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM