[英]Skip rows which has the same name in one column
因此,我想从我的数据中删除具有相同“ id”的行(在本例中为221和341)。如您所见,其他列中的值不同,因此我想删除重复的行(相同的id),但是我仍然想保留其中一行。
>head(data)
id Sequence Score variable value
221 AAIYKLLKSHF 30.87 BiotinControl1_2 221498.368
341 EKPLFLVFHGT 42.04 BiotinControl1_2 255198.8712
330 FMIAPTGAKTF 49.67 BiotinControl1_2 1526577.651
221 FMIAPTGAKTF 40.69 BiotinControl2 429384.4861
341 GKLKGVLGYTE 39.34 BiotinTreatment1 597884.8329
196 IKKVVKAAAE 39.81 BiotinControl1_2 2277435.528
117 IKKVVKAAAE 44.86 BiotinControl1_2 1050394.325
所以我的数据应该是这样的:
id Sequence Score variable value
221 AAIYKLLKSHF 30.87 BiotinControl1_2 221498.368
341 EKPLFLVFHGT 42.04 BiotinControl1_2 255198.8712
330 FMIAPTGAKTF 49.67 BiotinControl1_2 1526577.651
196 IKKVVKAAAE 39.81 BiotinControl1_2 2277435.528
117 IKKVVKAAAE 44.86 BiotinControl1_2 1050394.325
我应该使用哪个函数来执行类似的操作?
使用duplicated
dat[!duplicated(dat$id),]
id Sequence Score variable value
1 221 AAIYKLLKSHF 30.87 BiotinControl1_2 221498.4
2 341 EKPLFLVFHGT 42.04 BiotinControl1_2 255198.9
3 330 FMIAPTGAKTF 49.67 BiotinControl1_2 1526577.7
6 196 IKKVVKAAAE 39.81 BiotinControl1_2 2277435.5
7 117 IKKVVKAAAE 44.86 BiotinControl1_2 1050394.3
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.