从字符串中提取特定值

Question

Suppose I have a list of string:假设我有一个字符串列表：

distance <- c("CHI #12 DEBRINCAT(1), Snap, Off. Zone, 18 ft.Assists: #88 KANE(2); #56 GUSTAFSSON(1)", "TOR ONGOAL - #44 RIELLY, Backhand, Off. Zone, 77 ft.")

Now I hope to get a string vector that contains only the parts that contains the distance, that is, substring = c ("18 ft", "77 ft").现在我希望得到一个只包含包含距离的部分的字符串向量，即substring = c("18 ft", "77 ft")。

Is there a convenient way in R to do this? R 中是否有方便的方法来执行此操作？

Answer 1

Using str_extract to match one or more digits followed by zero or more spaces ( \\s* ) and the substring 'ft'使用str_extract匹配一个或多个数字后跟零个或多个空格 ( \\s* ) 和 substring 'ft'

library(stringr)
str_extract(distance, "\\d+\\s*ft")
#[1] "18 ft" "77 ft"

Answer 2

Alternatives:备择方案：

regmatches(distance, gregexpr("\\b[0-9]+\\s*ft", distance, perl = TRUE))
# [[1]]
# [1] "18 ft"
# [[2]]
# [1] "77 ft"

strcapture("\\b([0-9]+\\s*ft)", distance, list(dist = ""))
#    dist
# 1 18 ft
# 2 77 ft

Though they're all just doing the same thing with slightly different interfaces.尽管他们都只是在做同样的事情，但界面略有不同。

Answer 3

Try gsub试试gsub

> gsub(".*?(\\d+\\s+ft).*", "\\1", distance)
[1] "18 ft" "77 ft"

从字符串中提取特定值

问题描述

3 个解决方案

解决方案1
3 已采纳 2021-03-30 01:28:11

解决方案2
3 2021-03-30 01:29:53

解决方案3
1 2021-03-30 08:10:10

从字符串中提取特定值

问题描述

3 个解决方案

解决方案1 3 已采纳 2021-03-30 01:28:11

解决方案2 3 2021-03-30 01:29:53

解决方案3 1 2021-03-30 08:10:10

解决方案1
3 已采纳 2021-03-30 01:28:11

解决方案2
3 2021-03-30 01:29:53

解决方案3
1 2021-03-30 08:10:10