在python中的正则表达式匹配中连续转义字符串

Question

I met one problem when I use regex to match some string using Python. 当我使用regex使用Python匹配某些字符串时遇到了一个问题。

Example string: 示例字符串：

ln[1] --This is a string-- ln [1]-这是一个字符串-

ln[2] Match the line below. ln [2]匹配以下行。

ln[3] --This is a string-- ln [3]-这是一个字符串-

ln[4] Match this line start from here. ln [4]匹配此行，从这里开始。

ln[5] -This is the end- ln [5]-这是结局-

I want to extract abc in the string above. 我想在上面的字符串中提取abc。

code: 码：

pattern = re.compile('%s(.*?)%s' % ('--This is a string--', '-This is the end-'))
re.findall(pattern, string)

How can I get the line 4 only, not get line 2 to line 4 ? 我怎样才能只获得第4行，而不是获得第2行到第4行？

Thank you very much. 非常感谢你。

Answer 1

Probably, via this: 大概是这样的：

pattern = re.compile('.*(a.*?c)')
re.findall(pattern, string)  # yields ["abc"]

Answer 2

>>> re.findall('a[^a]*c', 'aaaaaaaaabc')
['abc']
>>> re.findall('a[^a]*c', 'aaaaaaaaa c')
['a c']

Answer 3

If you want to replace all instances of repeated characters you could use id or named groups. 如果要替换所有重复字符的实例，则可以使用id或命名组。

Example: 例：

with id: ID：

>>> re.sub('(.)(\\1)+', '\\1', 'abcAAAAabcBBBBabcCCCCabc')
'abcAabcBabcCabc'

with name: 名称：

>>> re.sub('(?P<n>.)(?P=n)+', '\\1', 'abcAAAAabcBBBBabcCCCCabc')
'abcAabcBabcCabc'

在python中的正则表达式匹配中连续转义字符串

问题描述

3 个解决方案

解决方案1
2 2013-07-10 10:07:40

解决方案2
2 2013-07-10 10:09:58

解决方案3
1 2013-07-10 10:20:00

在python中的正则表达式匹配中连续转义字符串

问题描述

3 个解决方案

解决方案1 2 2013-07-10 10:07:40

解决方案2 2 2013-07-10 10:09:58

解决方案3 1 2013-07-10 10:20:00

解决方案1
2 2013-07-10 10:07:40

解决方案2
2 2013-07-10 10:09:58

解决方案3
1 2013-07-10 10:20:00