过滤以在特定列中的特定值第一次出现之前删除所有行

Question

I would like to filter to remove all rows before the first time a particular value in a specific column appears.我想在特定列中的特定值第一次出现之前过滤以删除所有行。 For example, in the data frame below, I would like to remove all rows before bob appears in column a for the first time.例如，在下面的数据框中，我想在bob第一次出现在a列之前删除所有行。 Please note that the value of bob repeats a second time -I only want to remove the rows before the first time bob appears.请注意bob的值第二次重复 - 我只想在第一次bob出现之前删除行。

(dat<-data.frame(a= c("pete", "mike", "bob", "bart", "bob"), b=c(1,2,3,4,5), c=c("home", "away", "home", "away", "gone")))
     a b    c
1 pete 1 home
2 mike 2 away
3  bob 3 home
4 bart 4 away
5  bob 5 gone

I want the resulting data frame to look like the following:我希望生成的数据框如下所示：

   a   b  c
1 bob  3 home
2 bart 4 away
3 bob  5 gone

Answer 1

dplyr way using slice . dplyr方式使用slice 。

library(dplyr)
dat %>% slice(which.max(a == "bob") : n())

#     a b    c
#1  bob 3 home
#2 bart 4 away
#3  bob 5 gone

which in base R would be这在基础 R 中将是

dat[which.max(dat$a == "bob") : nrow(dat), ]

Answer 2

cumsum is usually a good candidate for such tasks cumsum通常是此类任务的理想人选

dat[cumsum(dat$a == "bob") >= 1, ]
#     a b    c
#3  bob 3 home
#4 bart 4 away
#5  bob 5 gone

Answer 3

We can use cummax我们可以使用cummax

library(dplyr)
dat %>%
     filter(cummax(a == "bob") > 0)
#     a b    c
#1  bob 3 home
#2 bart 4 away
#3  bob 5 gone

过滤以在特定列中的特定值第一次出现之前删除所有行

问题描述

3 个解决方案

解决方案1
8 已采纳 2019-04-11 07:24:45

解决方案2
4 2019-04-11 07:17:47

解决方案3
2 2019-04-11 12:45:57

过滤以在特定列中的特定值第一次出现之前删除所有行

问题描述

3 个解决方案

解决方案1 8 已采纳 2019-04-11 07:24:45

解决方案2 4 2019-04-11 07:17:47

解决方案3 2 2019-04-11 12:45:57

解决方案1
8 已采纳 2019-04-11 07:24:45

解决方案2
4 2019-04-11 07:17:47

解决方案3
2 2019-04-11 12:45:57