根据一列选择数据集的子集

Question

I have a dataset with 2 columns我有一个包含 2 列的数据集

                           text  created
    1                   cant do it with cards either 1/2/2014
    2                   cant do it with cards either 2/2/2014
    3                            Coming back home AK 2/2/2014
    4                            Coming back home AK 5/2/2014
    5                                 gotta try PNNL 1/2/2014
    6 Me and my Tart would love to flyLoveisintheAir 5/2/2014
    7 Me and my Tart would love to flyLoveisintheAir 6/2/2014

How can I get subset the dataset, based on the unique string of first column?如何根据第一列的唯一字符串获取数据集的子集？

                           text  created
    1                   cant do it with cards either 1/2/2014
    3                            Coming back home AK 2/2/2014
    5                                 gotta try PNNL 1/2/2014
    6 Me and my Tart would love to flyLoveisintheAir 5/2/2014


structure(list(text = structure(c(1L, 1L, 2L, 2L, 3L, 4L, 4L), .Label = c("cant do it with cards either", 
"Coming back home AK", "gotta try PNNL", "Me and my Tart would love to flyLoveisintheAir"
), class = "factor"), created = structure(c(1L, 2L, 2L, 3L, 1L, 
3L, 4L), .Label = c("1/2/2014", "2/2/2014", "5/2/2014", "6/2/2014"
), class = "factor")), .Names = c("text", "created"), class = "data.frame", row.names =  c(NA, 
-7L))

Answer 1

Try using duplicated and !尝试使用duplicated和! . . Consider df is your data.frame.考虑df是你的 data.frame。

> df[!duplicated(df$text), ]
                                            text  created
1                   cant do it with cards either 1/2/2014
3                            Coming back home AK 2/2/2014
5                                 gotta try PNNL 1/2/2014
6 Me and my Tart would love to flyLoveisintheAir 5/2/2014

Answer 2

there is a lot of possibilities:有很多可能性：

tab[!duplicated(tab$text),]
# with dplyr
filter(tab, !duplicated(text))

hth第

根据一列选择数据集的子集

问题描述

2 个解决方案

解决方案1
1 2014-04-16 13:04:02

解决方案2
0 2014-04-16 13:03:42

根据一列选择数据集的子集

问题描述

2 个解决方案

解决方案1 1 2014-04-16 13:04:02

解决方案2 0 2014-04-16 13:03:42

解决方案1
1 2014-04-16 13:04:02

解决方案2
0 2014-04-16 13:03:42