简体   繁体   English

Microsoft Excel 2003中的关联/数据挖掘

[英]Correlations/Data Mining in Microsoft Excel 2003

I have an Excel spreadsheet where each column is a certain variable. 我有一个Excel电子表格,其中每一列都是一个特定的变量。 At the end of my columns I have a special last column called "Type" which can be A, B, C, or D. 在我的专栏的最后,我有一个特殊的最后一列,称为“类型”,可以是A,B,C或D。

Each row is a data point with different variables that ends up in a certain "Type" bucket (A/B/C/D) recorded in the last column. 每行是一个具有不同变量的数据点,该数据点最终存储在最后一列中记录的某个“类型”存储区(A / B / C / D)中。

I need a way to examine all entries of a certain type (say, "C" or "C"|"D") and find out which of the variable(s) is a good predictor of this last column, and which are better predictors than others. 我需要一种方法来检查特定类型的所有条目(例如,“ C”或“ C” |“ D”),并找出哪个变量是最后一列的良好预测指标,而哪个更好比其他预测指标。

Some variables are numbers, others are fixed strings (from a set of strings), so it's not just a number/number correlation. 一些变量是数字,其他变量是固定的字符串(来自一组字符串),因此它不仅仅是数字/数字的相关性。

Is Excel 2003 a good tool for that, or are there better statistical programs that make this easier? Excel 2003是实现这一目标的好工具,还是有更好的统计程序可以简化此过程? Do I create a Pivot/Histogram for each category, or is there a better way to run these queries? 是否为每个类别创建数据透视图/直方图,还是有更好的方法来运行这些查询? Thanks 谢谢

You can make some filtering, especially to clean the data (I mean, to change the data values into one type, string or numeral) using microsoft excel. 您可以使用Microsoft excel进行一些过滤,尤其是清理数据(我的意思是将数据值更改为一种类型,字符串或数字)。 Execl also makes some data mining. Execl还进行了一些数据挖掘。 However, for the kind of problems you have, a good tool that I recommend you is WEKA. 但是,对于您遇到的问题,我建议您使用WEKA作为一个好的工具。 Using this tool, you can make associative classification prediction (ie, class association rule mining)of all data instances(rows) and therefore, you can determine which items fall belong to A/B/C/D. 使用此工具,可以对所有数据实例(行)进行关联分类预测(即,类别关联规则挖掘),因此,可以确定属于A / B / C / D的项。 Your special attribute will be your class attribute. 您的特殊属性将是您的类属性。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM