简体   繁体   English

如何在R中添加具有不同列名称的稀疏矩阵?

[英]How to add sparse matrices with different column names in R?

I have a list of sparse matrices with the same number of rows but different columns. 我有一个行数相同但列数不同的稀疏矩阵列表。

Here is a toy dataset: 这是一个玩具数据集:

library(dplyr)
library(Matrix)
ms <- list(
  m1 = data.frame(a = c(1, 10, 100), d = c(2, 20, 200), e = c(3, 30, 300)) %>% as.matrix %>% as("sparseMatrix"),
  m2 = data.frame(a = c(4, 40, 400), e = c(5, 50, 500), f = c(6, 60, 600), g = c(7, 70, 700)) %>% as.matrix%>% as("sparseMatrix"),
  m3 = data.frame(c = c(8, 80, 800), d = c(9, 90, 900)) %>% as.matrix%>% as("sparseMatrix")
)

I want to add every matrix in ms by column. 我想按列添加以ms为单位的每个矩阵。 This is how I'm currently doing it: 这是我目前正在做的事情:

# get a list of unique columns
final_names <- sapply(ms, colnames) %>% unlist %>% unique

# create an empty sparseMatrix of those dimensions
final_matrix <- matrix(0, nrow = nrow(ms$m1), ncol = length(final_names)) %>% 
  set_colnames(final_names) %>% as("sparseMatrix")

# add the matrices by column
for(mat in ms) {
  current_colnames <- colnames(mat)
  final_matrix[, current_colnames] <- mat + final_matrix[, current_colnames]
}

This is my output: 这是我的输出:

final_matrix
3 x 6 sparse Matrix of class "dgCMatrix"
       a    d   e   f   g   c
[1,]   5   11   8   6   7   8
[2,]  50  110  80  60  70  80
[3,] 500 1100 800 600 700 800

This works, but when I try it on the real dataset, I get a segmentation fault, so there must be a better way to create an empty sparse matrix or some other approach. 这可行,但是当我在真实数据集上尝试时,会遇到分割错误,因此必须有更好的方法来创建空的稀疏矩阵或其他方法。 Any ideas? 有任何想法吗?

NM = unique(unlist(lapply(ms, colnames)))
temp = do.call(cbind, ms)
sapply(NM, function(nm) rowSums(as.matrix(temp[,colnames(temp) %in% nm])))
#       a    d   e   f   g   c
#[1,]   5   11   8   6   7   8
#[2,]  50  110  80  60  70  80
#[3,] 500 1100 800 600 700 800

OR 要么

temp = do.call(cbind, lapply(ms, function(x) as.data.frame(as.matrix(x))))
sapply(split.default(temp, unlist(sapply(ms, colnames))), rowSums)
#       a   c    d   e   f   g
#[1,]   5   8   11   8   6   7
#[2,]  50  80  110  80  60  70
#[3,] 500 800 1100 800 600 700

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM