简体   繁体   English

相关系数

[英]Correlation coefficient

First of all, i'm sorry if this question is so basic. 首先,对不起,这个问题如此基本。 I'm trying just to calculate correlation coefficient from three lines of my dataframe : 我正在尝试仅从数据帧的三行计算相关系数:

df=structure(list(Id = 1:3, V1 = c(27L, 40L, 29L), V2 = c(70L, 
101L, 48L), V3 = c(68L, 84L, 55L), V4 = c(48L, 80L, 39L), V5 = c(58L, 
73L, 38L), V6 = c(80L, 103L, 46L), V7 = c(99L, 115L, 52L), V8 = c(46L, 
82L, 58L), V9 = c(26L, 38L, 33L), V10 = c(13L, 17L, 13L)), .Names = c("Id", 
"V1", "V2", "V3", "V4", "V5", "V6", "V7", "V8", "V9", "V10"), row.names = c(2L, 
5L, 8L), class = "data.frame")

What i'm doing is to convert these lines to vectors numeric 我正在做的是将这些行转换为矢量数字

df=df[-1]

g=as.numeric(df[1,])
h=as.numeric(df[2,])
i=as.numeric(df[3,])

and running correlation 2 per 2: 和每2个运行相关性2:

> cor(g,h)
[1] 0.9530113
> cor(g,i)
[1] 0.7557693
> cor(h,i)
[1] 0.8519315

I made search about this but it seems that there is no such function cor(g,h,i) , instead i Cant run cor(df) but it will gives me correlation between all the V1:V10 . 我对此进行了搜索,但似乎没有这样的函数cor(g,h,i) ,而是我无法运行cor(df)但是它将为我提供所有V1:V10之间的相关性。

In conclusion, is there function that allows me to execute cor(g,h,i) and return to me the three correlation coefficient (0.9530113 , 0.7557693 , 0.8519315) or a more optimised method than mine. 总之,是否存在允许我执行cor(g,h,i)并返回给我三个相关系数(0.9530113 , 0.7557693 , 0.8519315)或比我更优化的方法的(0.9530113 , 0.7557693 , 0.8519315)

# Get the correlation matrix by row
cor(t(df[-1]))
#           2         5         8
# 2 1.0000000 0.9530113 0.7557693
# 5 0.9530113 1.0000000 0.8519315
# 8 0.7557693 0.8519315 1.0000000

# Retrieve the correlation as vector
cor_mat <- cor(t(df[-1]))
cor_mat[upper.tri(cor_mat)]
# [1] 0.9530113 0.7557693 0.8519315

If you want a function: 如果需要功能:

corr <-function(data,g,h,i) {
 m <- cor(data[,c(g,h,i)])
 m[upper.tri(m)]
}

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM