根据重复模式分割字串

Question

I have the following sting: 我有以下问题：

a='text1fig [1]text2fig [15]text3fig [234]text4fig [2234]text5'

I want to split it to: 我想将其拆分为：

texts=['text1','text2','text3','text4','text5']

I tried: 我试过了：

import re    
texts=re.split('fig \\[[0-3000]\\]',texts) .

but it doesn't work. 但这不起作用。
thank you 谢谢

Answer 1

Number ranges don't work like that. 数字范围不是那样的。 Just use \\d instead. 只需使用\\d 。 Additionally, you'll want a single backslash, as a double backslash is taken to be a literal backslash (you want it to escape the [ / ] metachars instead). 另外，您将需要一个反斜杠，因为将双反斜杠视为文字反斜杠（您希望它转义[ / ]元字符）。

text = re.split(r'fig\s*\[\d+\]', a)

print(text)
['text1', 'text2', 'text3', 'text4', 'text5']

Regex Details 正则表达式详细信息

fig   
\s*   # 0 or more whitespace chars 
\[    # literal opening brace
\d+   # 1 or more digits
\]    # literal closing brace

Answer 2

You want to use "re.findall" instead of "re.split" 您要使用“ re.findall”而不是“ re.split”

texts = re.findall('text\d', a)

['text1', 'text2', 'text3', 'text4', 'text5']

根据重复模式分割字串

问题描述

2 个解决方案

解决方案1
3 已采纳 2017-09-15 13:01:04

解决方案2
0 2017-09-15 13:24:33

根据重复模式分割字串

问题描述

2 个解决方案

解决方案1 3 已采纳 2017-09-15 13:01:04

解决方案2 0 2017-09-15 13:24:33

解决方案1
3 已采纳 2017-09-15 13:01:04

解决方案2
0 2017-09-15 13:24:33