I am using base R trying to split the following character string into 2:
some_text | number1 (number2) | number1 (number2) | number1 (number2)
vec1 <- number1 number1 number1
vec2 <- number2 number2 number2
I have been able to remove the some_text
and the |
symbol but need help creating the vectors based on pattern matching.
Here is a base R option using strsplit
, sub
, and apply
:
x <- "some_text | 1 (2) | 3 (4) | 5 (6)"
y <- strsplit(x, "\\s*\\|\\s*", perl=TRUE)
number1 <- sapply(y, function(x) { sub(" \\(\\d+\\)", "", x) })[2:4]
number2 <- sapply(y, function(x) { sub("\\d+ \\((\\d+)\\)", "\\1", x) })[2:4]
number1
number2
[1] "1" "3" "5"
[1] "2" "4" "6"
We can use base R
with strsplit
m1 <- lapply(strsplit(str1, "[ |()]"), function(x)
matrix(x[x!=""][-1], ncol=2, byrow= TRUE))[[1]]
vec1 <- m1[,1]
vec2 <- m1[,2]
vec1
#[1] "number1" "number1" "number1"
vec2
#[1] "number2" "number2" "number2"
str1 <- 'some_text | number1 (number2) | number1 (number2) | number1 (number2)'
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.