为什么这个正则表达式不匹配完整的模式？

Question

import re

patterns = re.compile(r'(yesterday|today) \d{1,2} hours \d{1,2} minutes')

matches = re.findall(patterns, 'yesterday 9 hours 32 minutes today 10 hours 30 minutes')

print(matches)

The print output of the code above is:上面代码的打印 output 是：

['yesterday', 'today']

I hope it is:我希望它是：

['yesterday 9 hours 32 minutes', 'today 10 hours 30 minutes']

Why doesn't it match the full patterns?为什么它不匹配完整的模式？

Answer 1

You are using your initial capture group to designate the choice between yesterday and today:您正在使用初始捕获组来指定昨天和今天之间的选择：

(yesterday|today) -- grouping is a valid use of the capture group, but in this case it's having the unintended consequence of confusing you. (yesterday|today) ——分组是对捕获组的有效使用，但在这种情况下，它会产生让您感到困惑的意外后果。

You can handle this several ways.您可以通过多种方式处理此问题。 The following will get you the result you want using finditer and a reference to .group(0) which always indicates the full matched text:以下内容将使用 finditer 和对始终指示完整匹配文本的 .group .group(0)的引用为您提供所需的结果：

import re

patterns = re.compile(r'(yesterday|today) \d{1,2} hours \d{1,2} minutes')

matches = patterns.finditer('yesterday 9 hours 32 minutes today 10 hours 30 minutes')
for match in matches:
    print(match.group(0))

You could also do something like:你也可以这样做：

import re

patterns = re.compile(r'(?:yesterday|today) \d{1,2} hours \d{1,2} minutes')

matches = patterns.findall('yesterday 9 hours 32 minutes today 10 hours 30 minutes')
print(matches)

Which will convert the capture group to a non-capture group.这会将捕获组转换为非捕获组。

为什么这个正则表达式不匹配完整的模式？

问题描述

1 个解决方案

解决方案1
0 已采纳 2020-08-19 03:43:55

为什么这个正则表达式不匹配完整的模式？

问题描述

1 个解决方案

解决方案1 0 已采纳 2020-08-19 03:43:55

解决方案1
0 已采纳 2020-08-19 03:43:55