在Ruby中匹配多个模式时遇到麻烦（正则表达式）

Question

Basically, I want to extract the number after either HEAD or POST. 基本上，我想提取HEAD或POST之后的数字。

irb(main):001:0> "This is HEAD and a POST".match("HEAD|POST")
=> #<MatchData "HEAD">
irb(main):002:0> "This is HEAD and a POST".match("(HEAD|POST)")
=> #<MatchData "HEAD" 1:"HEAD">
irb(main):003:0> "This is HEAD and a POST".match("[HEAD|POST]")
=> #<MatchData "T">
irb(main):004:0> "This is HEAD 1 and a POST 2".match("[HEAD|POST] (.)")
=> #<MatchData "D 1" 1:"1">
irb(main):005:0>

The last regex didn't match the "2" that is after "POST". 最后一个正则表达式与“ POST”后的“ 2”不匹配。 Why? 为什么？ Also, why is "D 1" being matched? 另外，为什么匹配“ D 1”？

Answer 1

HEAD|POST and (HEAD|POST) match the same strings (either HEAD or POST); HEAD|POST和(HEAD|POST)匹配相同的字符串（HEAD或POST）； the second one captures the string while the first doesn't. 第二个捕获字符串，而第一个不捕获。

On the other hand, "This is HEAD 1 and a POST 2".match("[HEAD|POST] (.)") can't match the leading T because it isn't followed by a space - instead it matches the single D at the end of HEAD , plus the space and 1 following, capturing the 1. 另一方面， "This is HEAD 1 and a POST 2".match("[HEAD|POST] (.)")无法匹配前导T因为它后面没有空格-而是匹配前导T HEAD末尾的单个D加上空格和1 ，捕获了1。

Answer 2

try scan: 尝试扫描：

"This is HEAD 1 and a POST 2".scan /(HEAD|POST)\s(\d)/

=> [["HEAD", "1"], ["POST", "2"]]

在Ruby中匹配多个模式时遇到麻烦（正则表达式）

问题描述

2 个解决方案

解决方案1
4 已采纳 2012-07-11 13:19:10

解决方案2
1 2012-07-11 13:16:14

在Ruby中匹配多个模式时遇到麻烦（正则表达式）

问题描述

2 个解决方案

解决方案1 4 已采纳 2012-07-11 13:19:10

解决方案2 1 2012-07-11 13:16:14

解决方案1
4 已采纳 2012-07-11 13:19:10

解决方案2
1 2012-07-11 13:16:14