合并具有相同ID的行并删除重复的行

Question

After merging some data, I have multiple rows per ID. 合并一些数据后，每个ID有多行。 I ONLY want to keep multiple SAME ID's if the data differs. 如果数据不同，我只想保留多个SAME ID。 An NA value should be considered equal to any colwise data point. NA值应视为等于任何逐个数据点。

data: 数据：

df <- structure(list(id = c(1L, 2L, 2L, 2L, 3L, 3L, 4L, 4L, 4L, 5L), 
    v1 = structure(c(1L, 1L, NA, 1L, 1L, 1L, 1L, NA, 1L, 1L), .Label = "a", class = "factor"), 
    v2 = structure(c(1L, 2L, 2L, 3L, 1L, 1L, 1L, 1L, NA, 1L), .Label = c("a", 
    "b", "c"), class = "factor"), v3 = structure(c(1L, 1L, 1L, 
    1L, 1L, 1L, NA, 2L, 2L, 1L), .Label = c("a", "b"), class = "factor")), .Names = c("id", 
"v1", "v2", "v3"), row.names = c(NA, -10L), class = "data.frame")

looks like: 看起来像：

   id   v1   v2   v3
    1    a    a    a
    2    a    b    a
    2 <NA>    b    a
    2    a    c    a
    3    a    a    a
    3    a    a    a
    4    a    a <NA>
    4 <NA>    a    b
    4    a <NA>    b
    5    a    a    a

desired output: 所需的输出：

   id   v1   v2   v3
    1    a    a    a
    2    a    b    a
    2    a    c    a
    3    a    a    a
    4    a    a    b
    5    a    a    a

Happy if there exists a data.table solution. 如果存在一个data.table解决方案，那就很data.table 。

Answer 1

A possible solution using the data.table -package: 使用data.table可能解决方案：

library(data.table)
setDT(df)[, lapply(.SD, function(x) unique(na.omit(x))), by = id]

which gives: 这使：

  id v1 v2 v3 1: 1 aaa 2: 2 aba 3: 2 aca 4: 3 aaa 5: 4 aab 6: 5 aaa

Answer 2

First replace all NA with a respective column value , then find unique values 首先将所有NA替换为相应的列值，然后查找唯一值

library(data.table)
dt<-as.data.table(df)
for (j in seq_len(ncol(dt)))
     set(dt,which(is.na(dt[[j]])),j,dt[[j]][1]) #please feel to change dt[[j]][1] to na.omit(dt[[j]])[1] . It is a tradeoff between performance and perfection
unique(dt)
 id v1 v2 v3
1:  1  a  a  a
2:  2  a  b  a
3:  2  a  c  a
4:  3  a  a  a
5:  4  a  a  a
6:  4  a  a  b
7:  5  a  a  a

合并具有相同ID的行并删除重复的行

问题描述

data: 数据：

looks like: 看起来像：

desired output: 所需的输出：

2 个解决方案

解决方案1
4 已采纳 2017-11-08 15:14:26

解决方案2
1 2017-11-08 15:21:55

合并具有相同ID的行并删除重复的行

问题描述

data: 数据：

looks like: 看起来像：

desired output: 所需的输出：

2 个解决方案

解决方案1 4 已采纳 2017-11-08 15:14:26

解决方案2 1 2017-11-08 15:21:55

解决方案1
4 已采纳 2017-11-08 15:14:26

解决方案2
1 2017-11-08 15:21:55