简体繁体 English

如何使用正则表达式python提取多行文本

[英]How to extract multiline text using regex python

原文 2011-07-28 10:04:58 1 1 python/ regex/ multiline

Hi I have the following text. 嗨，我有以下文字。

x = """Hello, this is a\\nmultiline text\\nend.Hello, this is\\nthe second chunck\\nend.""" x =“”“您好，这是一个\\ n多行文字\\ nend。您好，这是\\ n第二个分块\\ nend。”“”

This pattern of Hello, \\nend. 您好，\\ nend的这种模式。 keeps on repeating. 不断重复。 I want to extract the text between each set of these two words. 我想在这两个单词的每组之间提取文本。 I tried using this 我尝试使用这个

b=re.search(r'(?<=Hello,).+(?=end)', x, re.DOTALL) b = re.search（r'（？<= Hello，）。+（？= end）'，x，re.DOTALL）

but I get all the text from the start to the end. 但我从头到尾都得到了所有文字。 How do I get the separate chunks of text? 如何获得单独的文本块？

Thanks.p Thanks.p

1 个解决方案

Use a lazy quantifier : .+? 使用惰性的量词 ： .+? instead of .+ . 而不是.+ 。

The problem is that the .+ matches as far as it can, so just eats all the way to the end of the documents. 问题是.+尽可能匹配，因此一直吃到文档末尾。 Adding the question mark tells it to match as little as it can. 添加问号会告诉它尽可能少地匹配。

如何在python中使用正则表达式在换行符处提取文本？ - How to extract text at newline using regex in python?

Python 多行正则表达式在每个时间戳后提取文本 - Python multiline regex extract text after every timestamp

Python中的正则表达式：从文本中提取具有重复相似版本的多行部分 - Regex in Python: extract a multiline part from a text with repeating similar editions

Python正则表达式提取字符串多行 - Python regex extract string multiline

从文本到文本MULTILINE的Python多行正则表达式 - Python multiline regex from text to text MULTILINE

多行正则表达式：如何在熊猫数据框中的日期之间提取文本？ - Multiline regex: How to extract text between dates in pandas dataframe?

Python正则表达式匹配多行文本 - Python regex match multiline text

在python中使用正则表达式多行提取两个子字符串之间的文本 - Extract text between two substrings using regular expression multiline in python

Python正则表达式从多行字符串中提取单词 - Python Regex to Extract Word From Multiline String

如何使用Python正则表达式通过不包含替代文本来提取子字符串 - How to extract substring by not including alternate text using Python regex

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何在python中使用正则表达式在换行符处提取文本？ - How to extract text at newline using regex in python? Python 多行正则表达式在每个时间戳后提取文本 - Python multiline regex extract text after every timestamp Python中的正则表达式：从文本中提取具有重复相似版本的多行部分 - Regex in Python: extract a multiline part from a text with repeating similar editions Python正则表达式提取字符串多行 - Python regex extract string multiline 从文本到文本MULTILINE的Python多行正则表达式 - Python multiline regex from text to text MULTILINE 多行正则表达式：如何在熊猫数据框中的日期之间提取文本？ - Multiline regex: How to extract text between dates in pandas dataframe? Python正则表达式匹配多行文本 - Python regex match multiline text 在python中使用正则表达式多行提取两个子字符串之间的文本 - Extract text between two substrings using regular expression multiline in python Python正则表达式从多行字符串中提取单词 - Python Regex to Extract Word From Multiline String 如何使用Python正则表达式通过不包含替代文本来提取子字符串 - How to extract substring by not including alternate text using Python regex

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM