如何去除R中的无用字符？

Question

I have a dataset like below, how can I remove the '#number'?我有一个如下所示的数据集，如何删除“#number”？

df>
terms                             year
5;#Remote Production;#10;         2021
53;#=Product-Category:Routing     2021
30;#HDR;#5;#Remote Production     2020
...

I need it to be like this:我需要它是这样的：

df>
terms                          year
#Remote Production             2021
#Product-Category:Routing      2021
#HDR;#Remote Production     2020
...

The number at the beginning without the # also needs to be removed开头没有#的数字也需要去掉

Answer 1

An option with str_remove str_remove的一个选项

library(stringr)
library(dplyr)
df %>%
   mutate(terms = str_c('#', str_remove_all(terms, "^\\d+;#\\=?|#\\d+;")))

-output -输出

#                     terms year
#1       #Remote Production; 2021
#2 #Product-Category:Routing 2021
#3   #HDR;#Remote Production 2020

data数据

df <- structure(list(terms = c("5;#Remote Production;#10;", "53;#=Product-Category:Routing", 
"30;#HDR;#5;#Remote Production"), year = c(2021L, 2021L, 2020L
)), class = "data.frame", row.names = c(NA, -3L))

如何去除R中的无用字符？

问题描述

1 个解决方案

解决方案1
4 已采纳 2021-02-10 18:53:24

data数据

如何去除R中的无用字符？

问题描述

1 个解决方案

解决方案1 4 已采纳 2021-02-10 18:53:24

data数据

解决方案1
4 已采纳 2021-02-10 18:53:24