如何使用str_which从Vector中选择包含字符串的行

Question

I have a Table like this 我有这样的桌子

name    <- c("Goku","Vegeta","Jiren","Gohan","Piccolo","Kurinin","Trunks","Buu","Frieza","Cell","Muten","Gotens")
surname <- c("San","San","San","San","San","San","San","Majin","Evil","San","Roshi","San")
email   <- c("goku@gmail.com","vegeta@gmail.com","jiren@patrol.ch","gohan@gmail.com","piccolo@gmail.com","kurinin@gmail.com","Trunks@gmail.com","buu@babidi.com","frieza@rampage.usa","cell@rampage.usa","muten@gmail.com","gotens@gmail.com")

table <- data.frame(name, surname, email, stringsAsFactors = FALSE)

And I have a Vector with different endings in email adresses. 我在电子邮件地址中有一个带有不同结尾的Vector。 I want to find all rows which use email adresses with this endings 我想找到所有以电子邮件地址结尾的行

searchvector = c("@patrol.ch", "@babidi.com", "@rampage.usa")
searchvector = as.character(searchvector)

There are two ways I tried to search for the rows containg the searchvector: 我尝试通过两种方式搜索包含searchvector的行：

A. Using str_detect: A.使用str_detect：

table[str_detect(table$email, "@patrol.ch|@babidi.com|@rampage.usa"), ]

This gives me the correct result 这给我正确的结果

name surname              email  
3   Jiren     San    jiren@patrol.ch  
8     Buu   Majin     buu@babidi.com  
9  Frieza    Evil frieza@rampage.usa  
10   Cell     San   cell@rampage.usa

B. But when using str_which, I always only get two rows B.但是当使用str_which时，我总是只得到两行

table[str_which(table$email, searchvector), ]
table[str_which(table$email, c("@patrol.ch", "@babidi.com", "@rampage.usa")), ]

I get this result in both cases: 在两种情况下，我都得到此结果：

name surname email  
8 Buu Majin buu@babidi.com
9 Frieza Evil frieza@rampage.usa

Why is that? 这是为什么？ And how can I use str_which to do what I want to accomplish? 以及如何使用str_which完成我想完成的工作？

Answer 1

According to ?str_which , it is a wrapper function 根据?str_which ，它是一个包装函数

str_which() is a wrapper around which(str_detect(x, pattern)), and is equivalent to grep(pattern, x). str_which（）是which（str_detect（x，pattern））的包装，并且等效于grep（pattern，x）。

Inorder to get the same output, we need a single string in pattern . 为了获得相同的输出，我们需要在pattern使用单个字符串。 It can he created with paste and specifying the collapse argument to | 他可以通过paste并在|指定collapse参数来创建它|

table[str_which(table$email, paste(searchvector, collapse="|")), ]
#     name surname              email
#3   Jiren     San    jiren@patrol.ch
#8     Buu   Majin     buu@babidi.com
#9  Frieza    Evil frieza@rampage.usa
#10   Cell     San   cell@rampage.usa

just like it was created for str_detect in the OP's post 就像在OP的帖子中为str_detect创建的str_detect

If we use the vector as pattern in str_detect 如果我们在str_detect向量用作pattern

table[str_detect(table$email, searchvector),]
#   name surname              email
#8    Buu   Majin     buu@babidi.com
#9 Frieza    Evil frieza@rampage.usa

returns the same output as in str_which with OP's code 返回与使用OP的代码在str_which相同的输出

Regarding the vectorization issue with str_detect , it is, but here the length of the 'email' and 'searchvector' is different. 关于vectorization与问题str_detect ，它是，但这里的length的“电子邮件”和“searchvector”是不同的。 So, there would be a recycling issue 所以会有回收问题

如何使用str_which从Vector中选择包含字符串的行

问题描述

1 个解决方案

解决方案1
1 2019-01-18 11:45:35

如何使用str_which从Vector中选择包含字符串的行

问题描述

1 个解决方案

解决方案1 1 2019-01-18 11:45:35

解决方案1
1 2019-01-18 11:45:35