简体   繁体   English

比较图表之间的相似性?

[英]Compare similarity between graphs?

I have multiple Concept Maps that are represented as directed graphs. 我有多个表示为有向图的概念图。 I have used this method , to compare 2 concept maps, but now I'd like to classify / cluster similar graphs together. 我已经使用此方法比较了2个概念图,但是现在我想将相似的图分类/聚类在一起。

AFAIK, the traditional clustering algorithm take input as multi-dimensional data points. 传统的聚类算法AFAIK将输入作为多维数据点。 But I've also read that it is difficult and not recommended to transform a graph into a vector. 但是我也读到它很困难, 不建议将图形转换为向量。

In that case, How do I approach this problem? 在那种情况下,我该如何解决这个问题?

Many (most, except for eg k-means, EM and Mean-shift) clustering algorithms use distances , not points. 许多(大多数除外,例如k均值,EM和均值平移)聚类算法使用距离而不是点。

For small data sets, hierarchical clustering is certainly the first method to try. 对于小型数据集,分层聚类无疑是第一种尝试的方法。 Single-link, complete-link, average-link have little formal requirements, ie they may be used either with a distance or a similarity, which does not need to satisfy the triangle inequality. 单链路,完整的链接,一般链接有什么正规的要求,即它们既可以与距离或相似性,这并不需要满足三角不等式中使用。 Other metrics such as Ward and centroid linkage require squared Euclidean distances and will probably not work here. 其他度量标准(例如Ward和质心链接)需要平方欧几里德距离,因此可能不适用于此处。

  1. compute pairwise graph matching distances 计算成对图匹配距离
  2. check for any normalization (eg graph size) required 检查是否需要任何规范化(例如图形大小)
  3. run hierarchical clustering 运行分层聚类
  4. study the dendrogram, you may need to go back and improve your normalization, distance, etc. 研究树状图,您可能需要返回并改善归一化,距离等。
  5. cut subtree clusters from the dendrogram 从树状图中切出子树簇

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Python:如何比较使用k-means算法的聚类之间的相似性? - Python: how to compare the similarity between clustering using k-means algorithm? 检查文本数据之间的相似性 - To check similarity between text data 计算Lucene文档和质心之间的相似度 - Calculating similarity between and centroid of Lucene documents 用于主题检测的推文之间的表示和良好的相似性度量 - Representation and a good similarity measure between Tweets for topic detection doc2vec聚类n * n文档之间的相似性 - doc2vec clustering n*n similarity between documents 如何在Python中相似性或重叠方面比较两个群集分组? - How can you compare two cluster groupings in terms of similarity or overlap in Python? 如何计算两个树状图中两个个体之间或两种聚类方法之间的同位相似度? - How to calculate the cophenetic similarity between two individual in two dendograms or between two clustering methods? 群集方案:2个点的calculatedCost之间的差异,用作点之间的相似性度量。 是否适用? - Cluster Scenario: Difference between the computedCost of 2 points used as similarity measure between points. Is it applicable? 可以使用什么机制来量化非数字列表之间的相似性? - What mechanism can be used to quantify similarity between non-numeric lists? 应该给出什么作为链接函数的输入-tfidf矩阵或tfidf矩阵的不同元素之间的相似性? - What should be given as an input to linkage function - tfidf matrix or similarity between different elements of tfidf matrixes?
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM