R：将数据帧列表合并为单个数据帧，使用列表索引添加列

Question

The question is very similar to this one . 问题与这一问题非常相似。 It is for combining a list of data frames into a single longer data frame. 它用于将数据帧列表组合成单个较长的数据帧。 However, I want to keep the information from which item of the list the data came from by adding an extra column with the index (id or source) of the list. 但是，我希望通过添加包含列表索引（id或source）的额外列来保留数据来自列表项的信息。

This is the data (borrowing code from the linked example): 这是数据（来自链接示例的借用代码）：

dfList <- NULL
set.seed(1)
for (i in 1:3) {
     dfList[[i]] <- data.frame(a=sample(letters, 5, rep=T), b=rnorm(5), c=rnorm(5))
}

Using the code below provides a concatenated data frame, but does not add the column for the list index.: 使用下面的代码提供了连接数据框，但不添加列表索引的列：

df <- do.call("rbind", dfList)

How do I concatenate the data frames in the list while creating a column to capture the origin within the list? 如何在创建列以捕获列表中的原点时连接列表中的数据框？ Something like the following: 类似于以下内容：

在此输入图像描述

Thank you very much in advance. 非常感谢你提前。

Answer 1

Try data.table::rbindlist 尝试data.table::rbindlist

library(data.table) # v1.9.5+
rbindlist(dfList, idcol = "index")
#     index a           b            c
#  1:     1 g  1.27242932 -0.005767173
#  2:     1 j  0.41464143  2.404653389
#  3:     1 o -1.53995004  0.763593461
#  4:     1 x -0.92856703 -0.799009249
#  5:     1 f -0.29472045 -1.147657009
#  6:     2 k -0.04493361  0.918977372
#  7:     2 a -0.01619026  0.782136301
#  8:     2 j  0.94383621  0.074564983
#  9:     2 w  0.82122120 -1.989351696
# 10:     2 i  0.59390132  0.619825748
# 11:     3 m -1.28459935 -0.649471647
# 12:     3 w  0.04672617  0.726750747
# 13:     3 l -0.23570656  1.151911754
# 14:     3 g -0.54288826  0.992160365
# 15:     3 b -0.43331032 -0.429513109

Answer 2

You can do this in base: 你可以在基地做到这一点：

df[["index"]] <- rep(seq_along(dfList), sapply(dfList, nrow))
df

##    a           b            c index
## 1  g  1.27242932 -0.005767173     1
## 2  j  0.41464143  2.404653389     1
## 3  o -1.53995004  0.763593461     1
## 4  x -0.92856703 -0.799009249     1
## 5  f -0.29472045 -1.147657009     1
## 6  k -0.04493361  0.918977372     2
## 7  a -0.01619026  0.782136301     2
## 8  j  0.94383621  0.074564983     2
## 9  w  0.82122120 -1.989351696     2
## 10 i  0.59390132  0.619825748     2
## 11 m -1.28459935 -0.649471647     3
## 12 w  0.04672617  0.726750747     3
## 13 l -0.23570656  1.151911754     3
## 14 g -0.54288826  0.992160365     3
## 15 b -0.43331032 -0.429513109     3

You can also do: 你也可以这样做：

library(qdapTools)
list_df2df(setNames(dfList, 1:3), "index")

##    index a           b            c
## 1      1 g  1.27242932 -0.005767173
## 2      1 j  0.41464143  2.404653389
## 3      1 o -1.53995004  0.763593461
## 4      1 x -0.92856703 -0.799009249
## 5      1 f -0.29472045 -1.147657009
## 6      2 k -0.04493361  0.918977372
## 7      2 a -0.01619026  0.782136301
## 8      2 j  0.94383621  0.074564983
## 9      2 w  0.82122120 -1.989351696
## 10     2 i  0.59390132  0.619825748
## 11     3 m -1.28459935 -0.649471647
## 12     3 w  0.04672617  0.726750747
## 13     3 l -0.23570656  1.151911754
## 14     3 g -0.54288826  0.992160365
## 15     3 b -0.43331032 -0.429513109

Answer 3

This is a dplyr solution that does exactly what you are looking for: 这是一个完全符合您要求的dplyr解决方案：

dfList <- NULL
set.seed(1)
for (i in 1:3) {
  dfList[[i]] <- data.frame(a=sample(letters, 5, rep=T), b=rnorm(5), c=rnorm(5))
}

df <- dplyr::bind_rows(dfList, .id = "index")

R：将数据帧列表合并为单个数据帧，使用列表索引添加列

问题描述

3 个解决方案

解决方案1
8 已采纳 2015-05-10 11:47:53

解决方案2
3 2015-05-10 11:48:43

解决方案3
1 2017-12-20 02:40:39

R：将数据帧列表合并为单个数据帧，使用列表索引添加列

问题描述

3 个解决方案

解决方案1 8 已采纳 2015-05-10 11:47:53

解决方案2 3 2015-05-10 11:48:43

解决方案3 1 2017-12-20 02:40:39

解决方案1
8 已采纳 2015-05-10 11:47:53

解决方案2
3 2015-05-10 11:48:43

解决方案3
1 2017-12-20 02:40:39