用 Python 解析文本文件？！ txt单词的独特模式

Question

我正在尝试解析来自文本文件的一系列消息，并使用 Python (2.7.3) 或任何其他 python 版本将它们保存为 txt 文件。

我有像this.txt这样的txt文件：

[#11:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
INFO isn't NULL
[#12:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]
[#13:3][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
PERFECT isn't NULL
[#4:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
Time is here [Tick:135055] , Time:  17, index: 608, CastedType:20002, area :0
[#15:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#16:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]
[#17:3][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#8:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
time is here [Tick:135055] , Time:  17, index: 608, CastedType:20002, area :0
[#16:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#14:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]
[#18:3][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#6:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
Time is here [Tick:135055] , Time:  17, index: 608, CastedType:20002, area :0

这是 txt 具有的所有行的类型格式，因此每一行在给定的 txt 文件上重复，并且它有自己独特的模式，如我上面所示，其中关键字[INFO] ， [PERFECT]不会根据消息更改在此消息模式中，这些关键字值不会更改。 考虑每一行都是一条新消息，因此在每一行都有一条新消息开始。

我试图在 python 中实现的是一个 function，它逐行读取 txt 文件，并且那里的所有行都有我上面提到的这种类型的模式，并以这种特定类型转储所有行：

[#12:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]

到另一个txt文件。 因此，如果我将 go 转到另一个 txt 文件，我将看到那里的所有行都有这种类型的消息：

[#12:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]

现在，在从给定的 txt（输入 txt）中嗅出这种类型的消息后，我需要逐行读取我生成的具有特定消息类型的新 txt 文件，然后获取加载索引值并将它们转储到另一个 txt 文件中这只是负载指数的值。

所以在我上面的例子中，我会得到这样的：

给定txt文件：（这是.txt文件作为输入）

[#11:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
INFO isn't NULL
[#12:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]
[#13:3][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
PERFECT isn't NULL
[#4:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
Time is here [Tick:135055] , Time:  17, index: 608, CastedType:20002, area :0
[#15:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#16:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]
[#17:3][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#8:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
time is here [Tick:135055] , Time:  17, index: 608, CastedType:20002, area :0
[#16:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#14:25][PERFECT][0x0015a] process returned as NULL load index[1] , length[20] , type[0]
[#18:3][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
[#6:23][INFO][0x0015a] it's here and it's optimally required start index[1] , length[15]
Time is here [Tick:135055] , Time:  17, index: 608, CastedType:20002, area :0

function 的结果/输出：

生成具有我上面解释的特定模式的所有行的 txt 文件（所有具有单词[PERFECT]的行，因此生成的 txt 文件应具有所有具有[PERFECT]的消息/行：
[#12:25] [PERFECT] [0x0015a] 进程返回为 NULL 加载索引 [1]，长度 [20]，类型 [0] [#16:25] [PERFECT] [0x0015a] 进程返回为 Z6C3E226B4D4795D518AB341B0824 加载索引 [ 1], length[20], type[0] [#14:25] [PERFECT] [0x0015a] 进程返回为 NULL 加载索引[1], length[20], type[0]
然后为负载索引值生成另一个新的 txt 文件，在我的情况下，负载索引值在单词负载索引 ( load index [value] ) 的 [ ] 内找到，因此 function 应在新的 txt 文件中转储负载的值索引作为列到另一个新生成的 txt 文件中：

1 1 1

如上所述，如何在 python 中解析包含此模式和消息行的文本文件？

简而言之，我想使用上面解释的消息模式逐行（逐个消息）运行给定的txt文件，然后将所有具有关键字[PERFECT]和括号的消息解析到新的txt文件中，所以我将在新生成的 txt 文件中仅包含关键字 [PERFECT] 的消息。现在有了这个新生成的文件，它只嗅探了具有关键字 [PERFECT] 的消息，然后循环并传递这个新生成的文件中的每条消息（具有唯一模式 [PERFECT] 的嗅探消息）以获得值出现在每条消息中的负载索引 [值] 在我的情况下是 1 1 1，因为负载索引 [1] 在三条消息中显示为 1。 负载索引值应转储到另一个新的 txt 文件中，该文件以负载索引值作为列。

非常感谢您的合作！

Answer 1

def get_statuses(s, t):
    statuses = []
    for line in s.splitlines():
        if line.startswith("[#"):
            meta, content = line.split(" ", 1)
            time, status, code = meta.split("][")
            time, code = time[2:], code[:-1]
            index = re.search(r'(index\[)(\d+)(\])', content).group(2)
            if status == t:
                statuses.append({
                    'time': time, 'code': code, 'content': content, 'index': index
                })
    return statuses

它将 output：

[{'time': '12:25',
  'code': '0x0015a',
  'content': 'process returned as NULL load index[1] , length[20] , type[0]',
  'index': '1'},
 {'time': '16:25',
  'code': '0x0015a',
  'content': 'process returned as NULL load index[1] , length[20] , type[0]',
  'index': '1'},
 {'time': '14:25',
  'code': '0x0015a',
  'content': 'process returned as NULL load index[1] , length[20] , type[0]',
  'index': '1'}]

您可以将 function output 用于csv.DictWriter() 。

用 Python 解析文本文件？！ txt单词的独特模式

问题描述

1 个解决方案

解决方案1
0 2021-12-14 16:21:39

用 Python 解析文本文件？！ txt单词的独特模式

问题描述

1 个解决方案

解决方案1 0 2021-12-14 16:21:39

解决方案1
0 2021-12-14 16:21:39