删除重复项以确保NA值R

Question

My data set(df) looks like, 我的数据集（df）看起来像

   ID    Name    Rating    Score  Ranking
   1     abc       3        NA      NA
   1     abc       3        12      13
   2     bcd       4        NA      NA
   2     bcd       4        19      20

I'm trying to remove duplicates which using 我正在尝试删除重复使用

   df <- df[!duplicated(df[1:2]),]

which gives, 这使，

   ID    Name    Rating    Score  Ranking
   1     abc       3        NA      NA
   2     bcd       4        NA      NA

but I'm trying to get, 但我想得到

   ID    Name    Rating    Score  Ranking
   1     abc       3        12      13
   2     bcd       4        19      20

How do I avoid rows containing NA's when removing duplicates at the same time, some help would be great, thanks. 当同时删除重复项时，如何避免包含NA的行，因此有些帮助将非常有用，谢谢。

Answer 1

First, push the NAs to last with na.last = T 首先，用na.last = T将NA推到最后

df<-df[with(df, order(ID, Name, Score, Ranking),na.last = T),]

then do the removing of duplicated ones with fromLast = FALSE argument: 然后使用fromLast = FALSE参数删除重复的fromLast = FALSE ：

df <- df[!duplicated(df[1:2],fromLast = FALSE),]

Answer 2

使用dplyr

df <- df %>% filter(!duplicated(.[,1:2], fromLast = T))

Answer 3

You could just filter out the observations you don't want with which() and then use the unique() function: 您可以使用which（）过滤掉不需要的观察值，然后使用unique（）函数：

a<-unique(c(which(df[,'Score']!="NA"), which(df[,'Ranking']!="NA")))

df2<-unique(df[a,])

> df2
  ID Name Rating Score Ranking
2  1  abc      3    12      13
4  2  bcd      4    19      20

删除重复项以确保NA值R

问题描述

3 个解决方案

解决方案1
1 2016-12-16 13:41:05

解决方案2
1 2017-10-31 13:45:37

解决方案3
0 2016-12-16 13:44:54

删除重复项以确保NA值R

问题描述

3 个解决方案

解决方案1 1 2016-12-16 13:41:05

解决方案2 1 2017-10-31 13:45:37

解决方案3 0 2016-12-16 13:44:54

解决方案1
1 2016-12-16 13:41:05

解决方案2
1 2017-10-31 13:45:37

解决方案3
0 2016-12-16 13:44:54