簡體   English   中英

從 r 中的長表創建一個對稱矩陣

[英]Create a matrix symmetric from long table in r

這是我的數據

df <- data.frame (Var1  <- c("a", "b", "c","d","e","b","c","d","e","c","d","e","d","e","e"),
                  Var2 <- c("a","a","a","a","a","b","b","b","b","c","c","c","d","d","e")
                  pre <- c(1,2,3,4,5,1,6,7,8,1,9,10,1,11,1) )

我想用 Var1 和 Var2 function 作為行名和列名構建一個對稱矩陣,矩陣值是 r 中“pre”中的對應數字,如下所示:

  a b c d e 
a 1 2 3 4 5
b 2 1 6 7 8 
c 3 6 1 9 10
d 4 7 9 1 11
e 5 8 10 11 1

這似乎是一個簡單的問題,但是我google了很多帖子,但都沒有解決,所以特地來問一下,謝謝!

夢瑩

您可以先獲取寬格式的數據。

library(dplyr)
library(tidyr)

mat <- df %>%
  pivot_wider(names_from = Var2, values_from = pre, values_fill = 0) %>%
  column_to_rownames('Var1') %>%
  as.matrix()

mat
#  a b  c  d e
#a 1 0  0  0 0
#b 2 1  0  0 0
#c 3 6  1  0 0
#d 4 7  9  1 0
#e 5 8 10 11 1

由於您有一個對稱矩陣,您可以將下三角矩陣復制到上三角。

mat[upper.tri(mat)] <- t(mat)[upper.tri(mat)]
mat

#  a b  c  d  e
#a 1 2  3  4  5
#b 2 1  6  7  8
#c 3 6  1  9 10
#d 4 7  9  1 11
#e 5 8 10 11  1

數據

df <- data.frame (Var1  = c("a", "b", "c","d","e","b","c","d","e","c","d","e","d","e","e"),
                  Var2 = c("a","a","a","a","a","b","b","b","b","c","c","c","d","d","e"),
                  pre = c(1,2,3,4,5,1,6,7,8,1,9,10,1,11,1) )

這是igraph package 的一個選項

g <- graph_from_data_frame(df,directed = FALSE)
E(g)$pre <- df$pre
get.adjacency(g,attr = "pre")

這使

  a b  c  d  e
a 1 2  3  4  5
b 2 1  6  7  8
c 3 6  1  9 10
d 4 7  9  1 11
e 5 8 10 11  1

基礎 R 解決方案(使用 Ronak 提供的數據):

# Crosstab: 
mdat <- as.data.frame.matrix(xtabs(pre ~ Var1 + Var2, df))

# Reflect on the diag (thanks @Ronak Shah):
mdat[upper.tri(mdat)] <- t(mdat)[upper.tri(mdat)]

正如@ThomasIsCoding 指出的那樣,我們也可以使用這個單線:

xtabs(pre ~ ., unique(rbind(df, cbind(setNames(rev(df[-3]), names(df)[-3]), df[3] ))))

正如@thelatemail 指出的那樣,我們還可以:

xtabs(pre ~ ., unique(data.frame(Map(c, df, df[c(2,1,3)]))))

這是一個基本的 R 版本:

df <- data.frame (Var1  = c("a", "b", "c","d","e","b","c","d","e","c","d","e","d","e","e"),
                  Var2 = c("a","a","a","a","a","b","b","b","b","c","c","c","d","d","e"),
                  pre = c(1,2,3,4,5,1,6,7,8,1,9,10,1,11,1))

# Generate new matrix/data frame
mat2 <- matrix(0, length(unique(df$Var1)), length(unique(df$Var2)))

# Name the columns and rows so we can access values
rownames(mat2) <- unique(df$Var1)
colnames(mat2) <- unique(df$Var2)

# Save values into appropriate places into data frame
mat2[as.matrix(df[, 1:2])] <- as.matrix(df[, 3])

# Using upper triangle trick from @Ronak Shah's answer
mat2[upper.tri(mat)] <- t(mat2)[upper.tri(mat2)]

# See results
mat2
#   a b  c  d  e
# a 1 2  3  4  5
# b 2 1  6  7  8
# c 3 6  1  9 10
# d 4 7  9  1 11
# e 5 8 10 11  1

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM