简体   繁体   English

数据挖掘中有哪些不同的模式评估措施?

[英]What are the different pattern evaluation measures in data mining?

在数据挖掘中,使用不同的算法(例如Apriori算法,FP-Tree等)发现频繁项集。那么这些模式评估方法吗?

You can try Association Rules (apriori for example), Collaborative Filtering (item-based or user-based) or even Clustering. 您可以尝试关联规则(例如apriori),协作过滤(基于项目或基于用户)甚至是群集。

I don't know what you are trying to do, but if you have a data-set and you need to find the most frequent item-set you should try some of the above techniques. 我不知道您要做什么,但是如果您有数据集并且需要找到最频繁的项目集,则应该尝试上述一些技巧。

If you're using R you should explore the arules package for association rules (for example). 如果您使用的是R,则应浏览arules包以获取关联规则(例如)。

Apriori algorithm and FP-tree algorithm is used to find frequent itemsets for the given transactional data. Apriori算法和FP-tree算法用于查找给定交易数据的频繁项集。 This would help in market basket analysis applications. 这将有助于市场分析应用。 For pattern evaluation, there are many components namely: 对于模式评估,有很多组件,即:

  1. support, 支持,
  2. confidence, 置信度,
  3. Lift, 电梯,
  4. Imbalance ratio, etc. 不平衡率等

More details can be seen at the paper: 可以在本文中看到更多详细信息:

Selecting the right interestingness measure for association patterns by Pang Ning Tan, Vipin Kumar, Jaideep Srivastava, KDD 2002. Pang Ning Tan,Vipin Kumar,Jaideep Srivastava,KDD 2002为关联模式选择正确的兴趣度度量。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM