[英]how to remove special characters and number patterns from a string in R
I have a string "<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks"
我有一个字符串"<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks"
I want to exclude everything except the name "Sacha Banks"
. 我想排除名称"Sacha Banks"
以外的所有内容。
I perform: 我执行:
name1<-c("<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks ")
name2<-str_replace_all(name1, "[^[:alnum:]]", " ")
Actual Output: " U 7F85 U 934F U 6DC7 U 2730 Sascha Banks "
实际输出: " U 7F85 U 934F U 6DC7 U 2730 Sascha Banks "
Expected Output: " Sascha Banks "
预期产出: " Sascha Banks "
Please correct me. 请纠正我。
Try 尝试
x <- "<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks"
gsub("(<.*>)", "", x)
## [1] " Sascha Banks"
Try 尝试
gsub("<[^>]*>", "", name1)
## [1] " Sascha Banks "
If you don't care to learn the regex this is a pretty straight forward approach that removes all angle brackets: 如果您不愿意学习正则表达式,这是一种非常简单的方法,它删除了所有尖括号:
library(qdap)
bracketX("<U+7F85><U+934F><U+6DC7> <U+2730> Sascha Banks", "angle")
## [1] "Sascha Banks"
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.