在 Python 中使用正则表达式查找字符串中的给定子字符串

Question

我试图在如下所示的字符串中查找所有出现的子字符串：

import re
S = 'aaadaa'
matches = re.finditer('(aa)', S)
if matches:
  #print(matches)
  for match in matches:
   print(match)
else:
    print("No match")

当前输出为：

<re.Match object; span=(0, 2), match='aa'>
<re.Match object; span=(4, 6), match='aa'>

但我希望它应该给出以下值：

<re.Match object; span=(0, 2), match='aa'>
<re.Match object; span=(1, 3), match='aa'>
<re.Match object; span=(4, 6), match='aa'>

有人可以帮我吗？

Answer 1

摘自我在评论中链接的答案，这是您需要的模式： (?=(aa)) 。

你必须访问匹配使用子match_obj.groups(1)并使用匹配指标match_obj.span(1)

Answer 2

这里的问题是，一旦re模块匹配双aa ，它也会消耗这两个字母。 但是，您想要重叠匹配。 您可以在这里使用的一个技巧是搜索a(?=a) ：

S = 'aaadaa'
matches = re.findall(r'a(?=a)', S)
matches = [s + "a" for s in matches]
print(matches)

['aa', 'aa', 'aa']

请注意，我们将第二个a标记到输出列表，因为在每一步实际上只匹配第一个字母。

在 Python 中使用正则表达式查找字符串中的给定子字符串

问题描述

我试图在如下所示的字符串中查找所有出现的子字符串：

2 个解决方案

解决方案1
1 2019-12-02 04:42:24

解决方案2
0 2019-12-02 04:38:18

在 Python 中使用正则表达式查找字符串中的给定子字符串

问题描述

我试图在如下所示的字符串中查找所有出现的子字符串：

2 个解决方案

解决方案1 1 2019-12-02 04:42:24

解决方案2 0 2019-12-02 04:38:18

解决方案1
1 2019-12-02 04:42:24

解决方案2
0 2019-12-02 04:38:18