I have a matrix that contains the string "Energy per �m". Before the 'm' is a diamond shaped symbol with a question mark in it - I don't know what it is.
I have tried to get rid of it by using this on the column of the matrix:
a=gsub('Energy per �m','',a)
[and using copy/paste for the first term of gsub], but it does not work.[unexpected symbol in "a=rep(5,Energy per"]. When I try to extract something from the original matrix with grepl I get:
46: In grepl("ref. value", raw$parameter) :
input string 15318 is invalid in this locale
How can I get rid of all this sort of signs? I would like to have only 0-9, AZ, az, / and '. The rest can be zapped.
There is probably a better way to do this than with regex (eg by changing the Encoding
).
But here is your regex solution:
gsub("[^0-9A-Za-z///' ]", "", a)
[1] "Energy per m"
But, as pointed out by @JoshuaUlrich, you're better off to use:
gsub("[^[:alnum:]///' ]", "", x)
[1] "Energy per m"
str_replace_all()
is an option if you prefer to use the stringr
package:
library(stringr)
x <- 'Energy per �m'
str_replace_all(x, "[^[:alnum:]///' ]", "")
[1] "Energy per m"
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.