正则表达式提取段落基于 2 正则表达式匹配

Question

I am working on an python automation script where I want extract specific paragraph based on regex match but I am stuck on how to extract the paragraph.我正在开发一个 python 自动化脚本，我想根据正则表达式匹配提取特定段落，但我被困在如何提取段落上。 The following is an example showing my case:以下是显示我的案例的示例：

Solution : (Consistent Pattern)解决方案：（一致模式）

The paragraph I want to extract (Inconsistent Pattern)我要提取的段落（Inconsistent Pattern）

Remote value: x (Consistent Pattern)远程值：x（一致模式）

The following is the program that I am currently working on and it will be great if anyone could enlighten me!以下是我目前正在做的程序，如果有人能指教我就太好了！

import re
test= 'Solution\s:'
test1='Remote'
with open('<filepath>', 'r') as extract:
            
            lines=extract.readlines()

            for line in lines:
                x = re.search(test, line)
                y = re.search(test1, line)
                if x is not y:
                    f4.write(line)
                    print('good')
                else:
                    print('stop')

Answer 1

This can be easily done using regular expressions, for example:这可以使用正则表达式轻松完成，例如：

import re

text = r"""
Solution\s:
The paragraph I
want to extract
Remote
Some useless text here
Solution\s:
Another paragraph
I want to
extract
Remote

"""
m = re.findall(r"Solution\\s:(.*?)Remote", text, re.DOTALL | re.IGNORECASE)
print(m)

Where text represents some text of interest (read in from a file, for example) from which we wish to extract all portions between the sentinel patterns Solution\\s: and Remote .其中text表示一些感兴趣的文本（例如从文件中读取），我们希望从中提取标记模式Solution\\s:和Remote之间的所有部分。 Here we use an IGNORECASE search so that the sentinel patterns are recognised even if spelt with different capitalization.在这里，我们使用 IGNORECASE 搜索，以便即使拼写不同的大小写也能识别哨兵模式。

The above code outputs:上面的代码输出：

['\nThe paragraph I\nwant to extract\n', '\nAnother paragraph\nI want to\nextract\n']

Read the Python re library documentation at https://docs.python.org/3/library/re.html for more details.有关更多详细信息，请阅读https://docs.python.org/3/library/re.html 上的 Python re 库文档。

正则表达式提取段落基于 2 正则表达式匹配

问题描述

1 个解决方案

解决方案1
0 已采纳 2020-10-13 08:02:24

正则表达式提取段落基于 2 正则表达式匹配

问题描述

1 个解决方案

解决方案1 0 已采纳 2020-10-13 08:02:24

解决方案1
0 已采纳 2020-10-13 08:02:24