用于获取字符后字符串中所有数字的正则表达式

Question

我试图解析以下字符串并返回最后一个方括号后的所有数字：

C9: Title of object (foo, bar) [ch1, CH12,c03,4]

所以结果应该是：

1,12,03,4

字符串和数字会改变。 重要的是得到'['之后的数字，不管它前面有什么字符（如果有的话）。 （我在python中需要这个，所以也没有原子组！）我已经尝试了我能想到的一切，包括：

 \[.*?(\d) = matches '1' only
 \[.*(\d) = matches '4' only
 \[*?(\d) = matches include '9' from the beginning

等等

任何帮助是极大的赞赏！

编辑：我也需要这样做而不使用str.split（）。

Answer 1

您最好在最后一个[括号后面的子字符串中找到所有数字：

>>> s = 'C9: Title of object (fo[ 123o, bar) [ch1, CH12,c03,4]'
>>> # Get substring after the last '['.
>>> target_string = s.rsplit('[', 1)[1]
>>>
>>> re.findall(r'\d+', target_string)
['1', '12', '03', '4']

如果你不能使用split，那么这个可以使用前瞻断言：

>>> s = 'C9: Title of object (fo[ 123o, bar) [ch1, CH12,c03,4]'
>>> re.findall(r'\d+(?=[^[]+$)', s)
['1', '12', '03', '4']

这将找到所有数字，后面只有非[字符直到结尾。

Answer 2

使用非贪心可能有帮助? 。 例如：

\[.*?(\d*?),.*?(\d*?),.*?(\d*?),.*?(\d*?)\]

而且，这是它的工作原理（来自https://regex101.com/r/jP7hM3/1 ）：

"\[.*?(\d*?),.*?(\d*?),.*?(\d*?),.*?(\d*?)\]"
\[ matches the character [ literally
.*? matches any character (except newline)
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
1st Capturing group (\d*?)
\d*? match a digit [0-9]
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
, matches the character , literally
.*? matches any character (except newline)
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
2nd Capturing group (\d*?)
\d*? match a digit [0-9]
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
, matches the character , literally
.*? matches any character (except newline)
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
3rd Capturing group (\d*?)
\d*? match a digit [0-9]
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
, matches the character , literally
.*? matches any character (except newline)
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
4th Capturing group (\d*?)
\d*? match a digit [0-9]
Quantifier: *? Between zero and unlimited times, as few times as possible, expanding as needed [lazy]
\] matches the character ] literally

虽然 - 我必须同意其他人......这是一个正则表达式解决方案，但它不是一个非常pythonic的解决方案。

用于获取字符后字符串中所有数字的正则表达式

问题描述

2 个解决方案

解决方案1
5 2015-12-17 15:45:10

解决方案2
0 2015-12-17 15:46:24

用于获取字符后字符串中所有数字的正则表达式

问题描述

2 个解决方案

解决方案1 5 2015-12-17 15:45:10

解决方案2 0 2015-12-17 15:46:24

解决方案1
5 2015-12-17 15:45:10

解决方案2
0 2015-12-17 15:46:24