如何将矩阵的列分为两列？

Question

I have matrix, which one of the columns of it has IDs which sperated with ",". 我有矩阵，矩阵的其中一列具有以“，”分隔的ID。 I just want to split that cloumn into two columns st each new column just have one part of IDs. 我只想将该Cloumn分成两列，而每个新列只有一部分ID。 what is the easiest way to do it ? 最简单的方法是什么？

My matrix is: 我的矩阵是：

> L
     a    u                    
[1,] "10" "mature,MIMAT0000062"
[2,] "20" "stemloop"           
[3,] "40" "mature,MIMAT0000062"

and the expected output is : 预期的输出是：

> k
     a    u          v             
[1,] "10" "mature"   "MIMAT0000062"
[2,] "20" "stemloop" "NA"          
[3,] "40" "mature"   "MIMAT0000062"
>

Edit: 编辑：

Now I have to split this matrix into two matrix based on the column with "NA" values, one with all "NA" and other without "NA". 现在，我必须根据具有“ NA”值的列将该矩阵分为两个矩阵，一个矩阵全为“ NA”，另一个矩阵全为“ NA”。

Input: 输入：

>k
       a    u         v
[1,] "10" "mature"    "MIMAT0000062"
[2,] "20" "stemloop"  "NA"
[3,] "40" "mature_2"  "MIMAT0000043"

Output should be like, 输出应该像

>k1
       a    u         v
[1,] "10" "mature"    "MIMAT0000062"
[2,] "40" "mature_2"  "MIMAT0000043"

>k2
       a    u         v
[1,] "20" "stemloop"  "NA"

Answer 1

I have a function called cSplit that is quite fast and deals with these types of problems very easily. 我有一个名为cSplit的函数，该函数非常快，可以非常轻松地处理这些类型的问题。

Here are a few examples of the function in use, along with some different cases to consider: 以下是使用中的函数的一些示例，以及要考虑的一些不同情况：

Your existing sample data: 您现有的样本数据：

M1 <- cbind(a = c(10,20,40), 
            u = c("mature,MIMAT0000062", 
                  "stemloop", "mature,MIMAT0000062"))
cSplit(data.frame(M1), "u", ",")
#     a      u_1          u_2
# 1: 10   mature MIMAT0000062
# 2: 20 stemloop           NA
# 3: 40   mature MIMAT0000062

One "u" value with a comma at the start: 一个“ u”值，以逗号开头：

M2 <- cbind(a = c(10,20,40), 
            u = c(",MIMAT0000062", 
                  "stemloop", "mature,MIMAT0000062"))
cSplit(data.frame(M2), "u", ",")
#     a      u_1          u_2
# 1: 10          MIMAT0000062
# 2: 20 stemloop           NA
# 3: 40   mature MIMAT0000062

One "u" value that splits into 3 columns: 一个“ u”值，分为3列：

M3 <- cbind(a = c(10,20,40), 
            u = c("mature,MIMAT0000062", 
                  "stemloop,,something", "mature,MIMAT0000062"))
cSplit(data.frame(M3), "u", ",")
#     a      u_1          u_2       u_3
# 1: 10   mature MIMAT0000062        NA
# 2: 20 stemloop              something
# 3: 40   mature MIMAT0000062        NA

Answer 2

This works when the values are comma separated: 这些值用逗号分隔时有效：

sep_cols = matrix(unlist(strsplit(as.character(L$u), ",")), ncol = 2)
new_L = cbind(L, sep_cols)

Answer 3

A different way.. 一种不同的方式

a <- c(10,20,40)
u <- c("mature,MIMAT0000062", "stemloop", "mature,MIMAT0000062")
L <- data.frame(a,u)     #better use a data.frame


v <- strsplit(as.character(L$u), ",")
L$u <- sapply(v, `[`, 1)
L$v <- sapply(v, `[`, 2)

> L
#   a        u            v
#1 10   mature MIMAT0000062
#2 20 stemloop         <NA>
#3 40   mature MIMAT0000062

Answer 4

A two liner: 两个内胆：

L$v =sapply(strsplit(as.character(L$u),","), "[", 2)
L$u =sapply(strsplit(as.character(L$u),","), "[", 1)
#L
#   a        u            v
#1 10   mature MIMAT0000062
#2 20 stemloop         <NA>
#3 40   mature MIMAT0000062

Answer 5

Another alternative using reshape2::colsplit as joran suggested: 另一个使用reshape2::colsplit替代方法，如joran建议的那样：

library(reshape2)
k = cbind(a =L$a,colsplit(L$u,",",c("u","v")))
#k
#   a   u       v
#1  10  mature  MIMAT0000062
#2  20  stemloop     
#3  40  mature  MIMAT0000062

如何将矩阵的列分为两列？

问题描述

5 个解决方案

解决方案1
2 2014-06-27 16:40:39

解决方案2
1 2014-05-21 14:46:38

解决方案3
1 已采纳 2014-05-21 14:49:26

解决方案4
1 2014-05-21 14:51:03

解决方案5
1 2014-05-21 15:05:07

如何将矩阵的列分为两列？

问题描述

5 个解决方案

解决方案1 2 2014-06-27 16:40:39

解决方案2 1 2014-05-21 14:46:38

解决方案3 1 已采纳 2014-05-21 14:49:26

解决方案4 1 2014-05-21 14:51:03

解决方案5 1 2014-05-21 15:05:07

解决方案1
2 2014-06-27 16:40:39

解决方案2
1 2014-05-21 14:46:38

解决方案3
1 已采纳 2014-05-21 14:49:26

解决方案4
1 2014-05-21 14:51:03

解决方案5
1 2014-05-21 15:05:07