从文本文件python中获取一系列单词

Question

I have a text file and my goal is to generate an output file with all the words that are between two specific words. 我有一个文本文件，我的目标是生成一个输出文件，其中包含两个特定单词之间的所有单词。

For example, if I have this text: 例如，如果我有这个文字：

askdfghj... Hello world my name is Alex and I am 18 years all ...askdfgj.

And I want to obtain all the words between "my" and "Alex". 我想获得“我的”和“亚历克斯”之间的所有词语。

Output: 输出：

my name is Alex

I have it in mind... but I don't know how to create the range: 我记得了......但我不知道如何创建范围：

if 'my' in open(out).read():
        with open('results.txt', 'w') as f:
            if 'Title' in open(out).read():
                f.write('*')
        break

I want an output file with the sentence "my name is Alex". 我想要一个带有句子“我的名字是Alex”的输出文件。

Answer 1

You can use regex here: 你可以在这里使用regex ：

>>> import re
>>> s = "askdfghj... Hello world my name is Alex and I am 18 years all ...askdfgj."
>>> re.search(r'my.*Alex', s).group()
'my name is Alex'

If string contains multiple Alex after my and you want only the shortest match then use .*? 如果字符串在my之后包含多个Alex并且您只想要最短的匹配，那么使用.*? : ：

With ? 用? : ：

>>> s = "my name is Alex and you're Alex too."
>>> re.search(r'my.*?Alex', s).group()
'my name is Alex'

Without ? 没有? : ：

>>> re.search(r'my.*Alex', s).group()
"my name is Alex and you're Alex"

Code: 码：

with open('infile') as f1, open('outfile', 'w') as f2:
    data = f1.read()
    match = re.search(r'my.*Alex', data, re.DOTALL)
    if match:
        f2.write(match.group())

Answer 2

You can use the regular expression my.*Alex 你可以使用正则表达式my.*Alex

data = "askdfghj... Hello world my name is Alex and I am 18 years all ...askdfgj"
import re
print re.search("my.*Alex", data).group()

Output 产量

my name is Alex

从文本文件python中获取一系列单词

问题描述

2 个解决方案

解决方案1
2 已采纳 2013-11-09 14:56:35

解决方案2
0 2013-11-09 14:56:40

从文本文件python中获取一系列单词

问题描述

2 个解决方案

解决方案1 2 已采纳 2013-11-09 14:56:35

解决方案2 0 2013-11-09 14:56:40

解决方案1
2 已采纳 2013-11-09 14:56:35

解决方案2
0 2013-11-09 14:56:40