在两个单词之间拆分字符串

Question

下面的字符串

text = 'FortyGigE1/0/53\r\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\nFortyGigE1/0/54\r\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\n'

应该拆分成这样：

output = [
    'FortyGigE1/0/53\r\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\n',
    'FortyGigE1/0/54\r\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\n'
]

拆分后不应删除分隔符。

delimiters = '(GigabitEthernet\d*/\d*/\d*\s.*|FortyGigE\d*/\d*/\d*\s.*)'

我试图这样做：

output = re.split(delimiters, text)

但我的输出将是这样的，比我预期的要多得多：

['',
 'FortyGigE1/0/53\r', '\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\n',
 'FortyGigE1/0/54\r', '\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\n']

Answer 1

至少在您的示例中，您可以执行以下操作：

>>> re.split(r'(?<=DOWN\r\n\r\n)(?=FortyGigE)', text)
['FortyGigE1/0/53\r\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\n',
 'FortyGigE1/0/54\r\nCurrent state: DOWN\r\nLine protocol state: DOWN\r\n\r\n']

与您声明的所需输出相比：

>>> output==re.split(r'(?<=DOWN\r\n\r\n)(?=FortyGigE)', text)
True

它通过使用零宽度回顾(?<=DOWN\\r\\n\\r\\n)和零宽度(?=FortyGigE)作为拆分点来工作。

这是一个 regex101 演示； \\r被删除，因为它们在该平台上不受支持。

Answer 2

你的提示给了我解决我的问题的方法。 这是我的脚本的摘录：

f = open(file, "r")
content = f.read()
f.close()
#
# This deliminator is only an example. The interface names are much longer
deliminators = r'(?=\nBridge-Aggregation|\nHundredGigE|\nFortyGigE|\nTen-GigabitEthernet)'
#
dev_interfaces = re.split(deliminators, content)
max_interfaces = len(dev_interfaces)
# Delete the beginning Linefeed (\n) of each interface
dev_interfaces[index] = dev_interfaces[index].lstrip('\n')

在两个单词之间拆分字符串

问题描述

2 个解决方案

解决方案1
0 2020-08-25 14:38:12

解决方案2
0 2020-09-03 14:50:17

在两个单词之间拆分字符串

问题描述

2 个解决方案

解决方案1 0 2020-08-25 14:38:12

解决方案2 0 2020-09-03 14:50:17

解决方案1
0 2020-08-25 14:38:12

解决方案2
0 2020-09-03 14:50:17