使用空格和大写字母分割字符串

Question

I'm trying to split my string into multiple rows. 我正在尝试将我的字符串分成多行。 String looks like this: 字符串看起来像这样：

x <- c("C 10.1 C 12.4","C 12", "C 45.5 C 10")

Code snippet: 程式码片段：

strsplit(x, "//s")[[3]]

Result: 结果：

"C 45.5 C 10"

Expected Output: Split string into multiple rows like this: 预期的输出：将字符串分成多行，如下所示：

"C 10.1"
"C 12.4"
"C 12"
"C 45.5"
"C 10"

The question is how to split the string? 问题是如何分割字符串？

Clue: there is a space and then character which is "C" in our case. 提示：在我们的例子中，有一个空格，然后是“ C”字符。 Anyone who knows how to do it? 有谁知道该怎么做？

Answer 1

You may use 您可以使用

unlist(strsplit(x, "(?<=\\d)\\s+(?=C)", perl=TRUE))

Output: 输出：

[1] "C 10.1" "C 12.4" "C 12"   "C 45.5" "C 10"

See the online R demo and a regex demo . 请参见在线R演示和regex演示。

The (?<=\\\\d)\\\\s+(?=C) regex matches 1 or more whitespace characters ( \\\\s+ ) that are immediately preceded with a digit ( (?<=\\\\d) ) and that are immediately followed with C . (?<=\\\\d)\\\\s+(?=C)正则表达式与1个或多个空格字符（ \\\\s+ ）匹配，这些字符紧跟数字（ (?<=\\\\d) ）并紧随其后其次是C

If C can be any uppercase ASCII letter, replace C with [AZ] . 如果C可以是任何大写ASCII字母，请将C替换为[AZ] 。

Answer 2

A somwhat more complicated expression but easier on the regex side: 更复杂的表达式，但在正则表达式方面更容易：

unlist(
  sapply(
    strsplit(x, " ?C"),
    function(x) {
      paste0("C", x[nzchar(x)])
    }
  )
)
"C 10.1" "C 12.4" "C 12"   "C 45.5" "C 10"

使用空格和大写字母分割字符串

问题描述

2 个解决方案

解决方案1
2 已采纳 2018-07-05 12:09:15

解决方案2
1 2018-07-05 14:15:59

使用空格和大写字母分割字符串

问题描述

2 个解决方案

解决方案1 2 已采纳 2018-07-05 12:09:15

解决方案2 1 2018-07-05 14:15:59

解决方案1
2 已采纳 2018-07-05 12:09:15

解决方案2
1 2018-07-05 14:15:59