Python Regex-如何獲取匹配項的位置和值

Question

如何使用re模塊獲取所有比賽的開始和結束位置？ 例如，給定模式r'[az]'和字符串'a1b2c3d4'我想獲取找到每個字母的位置。 理想情況下，我也想找回比賽的文字。

Answer 1

import re
p = re.compile("[a-z]")
for m in p.finditer('a1b2c3d4'):
    print(m.start(), m.group())

Answer 2

取自

正則表達式操作方法

span（）在單個元組中返回起始索引和結束索引。 由於match方法僅檢查RE是否在字符串開頭匹配，因此start（）始終為零。 但是，RegexObject實例的搜索方法將掃描字符串，因此在這種情況下，匹配可能不會從零開始。

>>> p = re.compile('[a-z]+')
>>> print p.match('::: message')
None
>>> m = p.search('::: message') ; print m
<re.MatchObject instance at 80c9650>
>>> m.group()
'message'
>>> m.span()
(4, 11)

結合使用：

在Python 2.2中，finditer（）方法也可用，它返回一個MatchObject實例序列作為迭代器。

>>> p = re.compile( ... )
>>> iterator = p.finditer('12 drummers drumming, 11 ... 10 ...')
>>> iterator
<callable-iterator object at 0x401833ac>
>>> for match in iterator:
...     print match.span()
...
(0, 2)
(22, 24)
(29, 31)

您應該能夠按以下順序進行操作

for match in re.finditer(r'[a-z]', 'a1b2c3d4'):
   print match.span()

Answer 3

對於Python 3.x

from re import finditer
for match in finditer("pattern", "string"):
    print(match.span(), match.group())

對於字符串中的每個匹配，您將獲得\\n分隔的元組（分別包含匹配的第一個和最后一個索引）和匹配本身。

Answer 4

請注意，跨度和組在正則表達式中被索引為多個捕獲組

regex_with_3_groups=r"([a-z])([0-9]+)([A-Z])"
for match in re.finditer(regex_with_3_groups, string):
    for idx in range(0, 4):
        print(match.span(idx), match.group(idx))

Python Regex-如何獲取匹配項的位置和值

問題描述

4 個解決方案

解決方案1
123 已采納 2008-10-30 14:15:39

解決方案2
47 2008-10-30 14:16:02

解決方案3
17 2017-07-05 13:08:18

解決方案4
0 2019-07-23 15:22:57

Python Regex-如何獲取匹配項的位置和值

問題描述

4 個解決方案

解決方案1 123 已采納 2008-10-30 14:15:39

解決方案2 47 2008-10-30 14:16:02

解決方案3 17 2017-07-05 13:08:18

解決方案4 0 2019-07-23 15:22:57

解決方案1
123 已采納 2008-10-30 14:15:39

解決方案2
47 2008-10-30 14:16:02

解決方案3
17 2017-07-05 13:08:18

解決方案4
0 2019-07-23 15:22:57