正则表达式错误 - 无需重复

Question

使用此表达式时收到错误消息：

re.sub(r"([^\s\w])(\s*\1)+","\\1","...")

我在RegExr检查了正则表达式，它返回. 正如预期的那样。 但是当我在 Python 中尝试时，我收到此错误消息：

raise error, v # invalid expression
sre_constants.error: nothing to repeat

有人可以解释一下吗？

Answer 1

这似乎是一个 python 错误（在 vim 中完美运行）。 问题的根源是 (\\s*...)+ 位。 基本上，你不能做(\\s*)+有意义的，因为你试图重复一些可以为空的东西。

>>> re.compile(r"(\s*)+")
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/System/Library/Frameworks/Python.framework/Versions/2.5/lib/python2.5/re.py", line 180, in compile
    return _compile(pattern, flags)
  File "/System/Library/Frameworks/Python.framework/Versions/2.5/lib/python2.5/re.py", line 233, in _compile
    raise error, v # invalid expression
sre_constants.error: nothing to repeat

然而(\\s*\\1)不应该为空，但我们知道它只是因为我们知道 \\1 中的内容。 显然 python 没有......这很奇怪。

Answer 2

这是“*”和特殊字符之间的 Python 错误。

而不是

re.compile(r"\w*")

尝试：

re.compile(r"[a-zA-Z0-9]*")

它可以工作，但是不会生成相同的正则表达式。

此错误似乎已在 2.7.5 和 2.7.6 之间修复。

Answer 3

实际上，这不仅是带有 * 的 Python 错误，当您将字符串作为要编译的正则表达式的一部分传递时，也会发生这种情况，例如 ;

import re
input_line = "string from any input source"
processed_line= "text to be edited with {}".format(input_line)
target = "text to be searched"
re.search(processed_line, target)

例如，如果处理的行包含一些“（+）”，这将导致错误，就像您可以在化学式或这样的字符链中找到的那样。 解决办法是逃跑，但是当你在飞行中逃跑时，可能会发生你未能正确完成的情况......

Answer 4

正则表达式在语言理论中通常使用 * 和 +。 我在执行行代码时遇到了同样的错误

re.split("*",text)

要解决它，它需要在 * 和 + 之前包含 \\

re.split("\*",text)

Answer 5

除了发现和修复的错误之外，我会注意到错误消息sre_constants.error: nothing to repeat有点令人困惑。 我试图使用r'?.*'作为模式，并认为它出于某种奇怪的原因抱怨* ，但问题实际上是? 是一种说法“重复零次或一次”。 所以我需要说r'\\?.*'来匹配文字?

Answer 6

我在使用正则表达式\\b?时遇到了这个问题\\b? . 使用\\s? 修复了问题（虽然不是一回事）

正则表达式错误 - 无需重复

问题描述

6 个解决方案

解决方案1
52 已采纳 2010-09-09 09:42:23

解决方案2
19 2011-10-24 08:24:27

解决方案3
9 2017-06-20 15:52:21

解决方案4
7 2019-12-22 00:47:40

解决方案5
6 2017-10-05 19:22:32

解决方案6
0 2021-12-21 23:21:18

正则表达式错误 - 无需重复

问题描述

6 个解决方案

解决方案1 52 已采纳 2010-09-09 09:42:23

解决方案2 19 2011-10-24 08:24:27

解决方案3 9 2017-06-20 15:52:21

解决方案4 7 2019-12-22 00:47:40

解决方案5 6 2017-10-05 19:22:32

解决方案6 0 2021-12-21 23:21:18

解决方案1
52 已采纳 2010-09-09 09:42:23

解决方案2
19 2011-10-24 08:24:27

解决方案3
9 2017-06-20 15:52:21

解决方案4
7 2019-12-22 00:47:40

解决方案5
6 2017-10-05 19:22:32

解决方案6
0 2021-12-21 23:21:18