在Python中使用正则表达式获取字符之前/之后的单词

Question

This is supposed to be easy using capturing groups, but I am not getting the correct words.使用捕获组应该很容易，但我没有得到正确的词。 I have been using the following:我一直在使用以下内容：

#Before
print(re.sub(r'\b([A-Za-z0-9]+)\b(?=\.?\s*(\&|\-|and))',r'\1','A. & B.',flags=re.IGNORECASE))
A. & B.

#After
print(re.sub(r'(\&|\-|and)\s*\b([A-Za-z0-9]+)\b',r'\2','A. & B.',flags=re.IGNORECASE))
A. B.

The string can be one of the following:字符串可以是以下之一：

A. - B.
A.-B.
A. & B.
A.&B.
A. AND B.

Why the capturing groups are not printing A and B in the previous examples?为什么前面例子中的捕获组没有打印A和B ？

Thanks in advance :)提前致谢：）

Answer 1

The string '\\1' is octal for the decimal value 1 or 0x01 hex.对于十进制值 1 或 0x01 十六进制，字符串'\\1'是八进制的。

>>> import re
>>> re.sub(r'\b([A-Za-z0-9]+)\b(?=\.?\s*(\&|\-|and))','\1','A. & B.',re.IGNORECASE)
'\x01. & B.'

Regex needs backreferences to be escaped.正则表达式需要转义反向引用。

Either of these replacement strings refer to capture group 1这些替换字符串中的任何一个都指的是捕获组 1
'\\\\r'

>>> import re
>>> re.sub(r'\b([A-Za-z0-9]+)\b(?=\.?\s*(\&|\-|and))','\\1','A. & B.',re.IGNORECASE)
'A. & B.'

Or,或者，

r'\\1'

>>> import re
>>> re.sub(r'\b([A-Za-z0-9]+)\b(?=\.?\s*(\&|\-|and))',r'\1','A. & B.',re.IGNORECASE)
'A. & B.'

Answer 2

Use re.search() instead and group the desired words before and after one of the options &,-,and :改用re.search()并在选项&,-,and之前和之后对所需的单词进行分组：

text = re.search('(\w+)\.+\s*[\&*\-*AND*and*]*\s*(\w+)\.+', 'A. & B.')
print (text.groups())

在Python中使用正则表达式获取字符之前/之后的单词

问题描述

2 个解决方案

解决方案1
1

解决方案2
0 2019-12-11 00:38:11

在Python中使用正则表达式获取字符之前/之后的单词

问题描述

2 个解决方案

解决方案1 1

解决方案2 0 2019-12-11 00:38:11

解决方案1
1

解决方案2
0 2019-12-11 00:38:11