Python 多行（多行）的正则表达式取决于第 1 行

Question

我有一个具有以下结构的日志（txt 文件）。

At 2020-07-15 14:05:18 - Markers detected in this frame : 3 | 6 | 
ID :6 out of compartment G2A44

或者

At 2020-07-15 14:05:47 - Markers detected in this frame : 3 | 0 | 9 | 
ID :9 out of compartment G2A13
ID :9 out of compartment G2A45

见正则表达式。

我需要的信息

2020-07-15 (群1)
14:05:47（第二组）
ID:9 (group4)
G2A13...

当我At 2020-07-15 14:05:47 - Markers detected in this frame: 3 | 0 | 9 | At 2020-07-15 14:05:47 - Markers detected in this frame: 3 | 0 | 9 | 一切都将被表达式expr = 'At ([0-9]{4}-[0-9]{2}-[0-9]{2}) ([0-9]{2}:[0-9]{2}:[0-9]{2}) - Markers detected in this frame: ([0-9]{1,}.{1,})\s(ID..[0-9])\sout of compartment ([\w]{4,})' 。

但是如何在正则表达式中获得具有相同组匹配的第二行或第三行？

import re
expr = 'At ([0-9]{4}-[0-9]{2}-[0-9]{2}) ([0-9]{2}:[0-9]{2}:[0-9]{2}) - Markers detected in this frame : ([0-9]{1,} .{1,})\s(ID..[0-9])\sout of compartment ([\w]{4,})'
f = 'XX.txt'
file = open(f,'r')
text = file.read()
m = []
m = re.findall(expr,text, re.MULTILINE)
print(m)

Answer 1

你要求一个解析器。 您需要一台 state 机器。

根据 header 表达式测试目标行，并存储一些值。 如果它没有通过该测试，则根据下一个表达式测试该行，并对新的匹配项和存储的值执行一些操作。

不要期望一次获得所有线路。 这是一个两阶段的工作。

Python 多行（多行）的正则表达式取决于第 1 行

问题描述

1 个解决方案

解决方案1
0 2020-07-19 18:15:18

Python 多行（多行）的正则表达式取决于第 1 行

问题描述

1 个解决方案

解决方案1 0 2020-07-19 18:15:18

解决方案1
0 2020-07-19 18:15:18