Python 3正则表达式最后一场比赛

Question

如何使用Python 3 regex模块获取以下字符串的123部分？

....XX (a lot of HTML characters)123

这里...部分表示由HTML字符，单词和数字组成的长字符串。

数字123是XX的特征。 因此，如果有人可以建议一种通用的方法，其中XX可以是任何字母，如AA或AB ，它会更有帮助。

边注：
我想通过首先在字符串中识别XX然后识别出现在XX之后的第一个数字来使用Perl的\\G运算符。 但似乎\\G运算符在Python 3中不起作用。

我的代码：

import re
source='abcd XX blah blah 123 more blah blah'
grade=str(input('Which grade?'))
#here the user inputs XX

match=re.search(grade,source)
match=re.search('\G\D+',source)
#Trying to use the \G operator to get the location of last match.Doesn't work.

match=re.search('\G\d+',source)
#Trying to get the next number after XX.
print(match.group())

Answer 1

描述

此正则表达式将匹配可以用用户输入替换的字符串值XX 。 正则表达式还要求XX字符串被空格包围或在示例文本的开头处，以防止在像EXXON这样的单词内找到XX的意外边缘情况。

(?<=\\s|^)\\b(xx)\\b\\s.*?\\s\\b(\\d+)\\b(?=\\s|$)

在此输入图像描述

代码示例：

我不太了解python以提供适当的python示例，因此我将包含一个PHP示例来简单地显示正则表达式如何工作以及捕获的组

<?php
$sourcestring="EXXON abcd XX blah blah 123 more blah blah";
preg_match('/(?<=\s|^)\b(xx)\b\s.*?\s\b(\d+)\b(?=\s|$)/im',$sourcestring,$matches);
echo "<pre>".print_r($matches,true);
?>

$matches Array:
(
    [0] => XX blah blah 123
    [1] => XX
    [2] => 123
)

如果你需要实际的字符串位置，那么在PHP中看起来就像

$position = strpos($sourcestring, $matches[0])

Python 3正则表达式最后一场比赛

问题描述

1 个解决方案

解决方案1
1 2013-06-08 15:35:38

描述

代码示例：

Python 3正则表达式最后一场比赛

问题描述

1 个解决方案

解决方案1 1 2013-06-08 15:35:38

描述

代码示例：

解决方案1
1 2013-06-08 15:35:38