简体繁体 English

数据挖掘中有哪些不同的模式评估措施？

[英]What are the different pattern evaluation measures in data mining?

原文 2015-05-27 13:25:47 7 2 algorithm/ data-mining/ market-basket-analysis

在数据挖掘中，使用不同的算法（例如Apriori算法，FP-Tree等）发现频繁项集。那么这些模式评估方法吗？

2 个解决方案

You can try Association Rules (apriori for example), Collaborative Filtering (item-based or user-based) or even Clustering. 您可以尝试关联规则（例如apriori），协作过滤（基于项目或基于用户）甚至是群集。

I don't know what you are trying to do, but if you have a data-set and you need to find the most frequent item-set you should try some of the above techniques. 我不知道您要做什么，但是如果您有数据集并且需要找到最频繁的项目集，则应该尝试上述一些技巧。

If you're using R you should explore the arules package for association rules (for example). 如果您使用的是R，则应浏览arules包以获取关联规则（例如）。

Apriori algorithm and FP-tree algorithm is used to find frequent itemsets for the given transactional data. Apriori算法和FP-tree算法用于查找给定交易数据的频繁项集。 This would help in market basket analysis applications. 这将有助于市场分析应用。 For pattern evaluation, there are many components namely: 对于模式评估，有很多组件，即：

support, 支持，
confidence, 置信度，
Lift, 电梯，
Imbalance ratio, etc. 不平衡率等

More details can be seen at the paper: 可以在本文中看到更多详细信息：

Selecting the right interestingness measure for association patterns by Pang Ning Tan, Vipin Kumar, Jaideep Srivastava, KDD 2002. Pang Ning Tan，Vipin Kumar，Jaideep Srivastava，KDD 2002为关联模式选择正确的兴趣度度量。