r、dplyr：如何使用 gsub 根据另一列中的值转换一列中的值

Question

I have a dataframe with two (relevant) factors, and I'd like to remove a substring equal to one factor from the value of the other factor, or leave it alone if there is no such substring.我有一个具有两个（相关）因素的 dataframe，我想从另一个因素的值中删除一个等于一个因素的 substring，或者如果没有这样的 substring，则不理会它。 Can I do this using dplyr ?我可以使用dplyr做到这一点吗？

To make a MWE, suppose these factors are x and y .要制作 MWE，假设这些因素是x和y 。

library(dplyr)
df <- data.frame(x = c(rep('abc', 3)), y = c('a', 'b', 'd'))

df : df ：

      x y
1   abc a
2   abc b
3   abc d

What I want:我想要的是：

      x y
1    bc a
2    ac b
3   abc d

My attempt was:我的尝试是：

df |> transform(x = gsub(y, '', x))

However, this produces the following, incorrect result, plus a warning message:但是，这会产生以下不正确的结果以及警告消息：

    x y
1  bc a
2  bc b
3  bc d

 Warning message:
 In gsub(y, "", x) :
    argument 'pattern' has length > 1 and only the first element will be used

How can I do this?我怎样才能做到这一点？

Answer 1

str_remove is vectorized for the pattern instead of gsub str_remove针对pattern而不是gsub进行矢量化

library(stringr)
library(dplyr)
df <- df %>% 
    mutate(x = str_remove(x, y))

-output -输出

df
    x y
1  bc a
2  ac b
3 abc d

If we want to use sub/gsub , then may need rowwise如果我们想使用sub/gsub ，那么可能需要rowwise

df %>%
   rowwise %>%
   mutate(x = sub(y, "", x)) %>%
   ungroup

r、dplyr：如何使用 gsub 根据另一列中的值转换一列中的值

问题描述

1 个解决方案

解决方案1
1 2021-11-21 18:38:42

r、dplyr：如何使用 gsub 根据另一列中的值转换一列中的值

问题描述

1 个解决方案

解决方案1 1 2021-11-21 18:38:42

解决方案1
1 2021-11-21 18:38:42