Python正則表達式是否支持像Perl的\\ G？

Question

我有一個Perl正則表達式（在這里顯示，雖然理解整個事情不是必須回答這個問題）包含\\ G元字符。 我想將它翻譯成Python，但Python似乎不支持\\ G. 我能做什么？

Answer 1

試試這些：

import re
re.sub()
re.findall()
re.finditer()

例如：

# Finds all words of length 3 or 4
s = "the quick brown fox jumped over the lazy dogs."
print re.findall(r'\b\w{3,4}\b', s)

# prints ['the','fox','over','the','lazy','dogs']

Answer 2

我知道我遲到了，但這里是\\G方法的替代品：

import re

def replace(match):
    if match.group(0)[0] == '/': return match.group(0)
    else: return '<' + match.group(0) + '>'

source = '''http://a.com http://b.com
//http://etc.'''

pattern = re.compile(r'(?m)^//.*$|http://\S+')
result = re.sub(pattern, replace, source)
print(result)

輸出（通過Ideone ）：

<http://a.com> <http://b.com>
//http://etc.

我們的想法是使用匹配兩種字符串的正則表達式：URL或注釋行。 然后使用回調（委托，閉包，嵌入代碼等）來找出匹配的那個並返回相應的替換字符串。

事實上，這是我的首選方法，即使是支持\\G口味。 即使在Java中，我也必須編寫一堆樣板代碼來實現回調。

（我不是一個Python人，所以請原諒我，如果代碼是非常pythonic。）

Answer 3

您可以使用re.match匹配錨定模式。 re.match只會在文本的開頭（位置0）或您指定的位置匹配。

def match_sequence(pattern,text,pos=0):
  pat = re.compile(pattern)
  match = pat.match(text,pos)
  while match:
    yield match
    if match.end() == pos:
      break # infinite loop otherwise
    pos = match.end()
    match = pat.match(text,pos)

這只會匹配給定位置的模式，以及之后跟隨0個字符的任何匹配。

>>> for match in match_sequence(r'[^\W\d]+|\d+',"he11o world!"):
...   print match.group()
...
he
11
o

Answer 4

Python的regexen沒有/ g修飾符，因此沒有\\ G regex令牌。 可惜，真的。

Answer 5

不要試圖將所有內容都放在一個表達式中，因為它變得非常難以閱讀，翻譯（如您自己所見）和維護。

import re
lines = [re.sub(r'http://[^\s]+', r'<\g<0>>', line) for line in text_block.splitlines() if not line.startedwith('//')]
print '\n'.join(lines)

從字面上翻譯Perl時，Python通常不是最好的，它有自己的編程模式。

Python正則表達式是否支持像Perl的\\ G？

問題描述

5 個解決方案

解決方案1
4 2009-02-09 20:42:33

解決方案2
2 2010-08-27 01:21:03

解決方案3
2 2009-02-09 21:05:24

解決方案4
2 2009-02-10 01:03:42

解決方案5
0 2009-02-10 06:13:33

Python正則表達式是否支持像Perl的\\ G？

問題描述

5 個解決方案

解決方案1 4 2009-02-09 20:42:33

解決方案2 2 2010-08-27 01:21:03

解決方案3 2 2009-02-09 21:05:24

解決方案4 2 2009-02-10 01:03:42

解決方案5 0 2009-02-10 06:13:33

解決方案1
4 2009-02-09 20:42:33

解決方案2
2 2010-08-27 01:21:03

解決方案3
2 2009-02-09 21:05:24

解決方案4
2 2009-02-10 01:03:42

解決方案5
0 2009-02-10 06:13:33