在一个单词之前和之后输出字符，然后写入文件

Question

Im trying to get this code to find say 30 words before a specific word and 30 words after. 我正在尝试获取此代码以在特定单词之前找到30个单词，之后找到30个单词。 then i want it to writ'e my output to a new file. 然后我希望它将我的输出写入新文件。 i cant seem to figure out what i'm doing wrong as im pretty new to python. 我似乎无法弄清楚我在做什么错，因为我是python的新手。 any suggestions are more than welcome. 任何建议都值得欢迎。

def extract_text(file_name, to_find):
    file_in = open('School.txt', 'r')

    all_lines = file_in.readlines()
    file_in.close()

    new_text = all_text.replace ('\n',  '|')

    width = 30



to_find = 'boy'
new_text = all_text.replace ('\n',  '|')
while new_text.find(to_find) != -1:
    start = all_text.find(to_find)
    begin = start - width
    end = start + len(to_find) + width



    print(new_text[begin:end])
    out_put = new_text[begin:end]

    f = open("School_boy.txt","w")
    f.write(out_put)

    f.close()

Answer 1

For text parsing, I would recommend using regex: 对于文本解析，我建议使用正则表达式：

import re

# Read the File
with open("file.txt", "r") as file:
    text = file.read()

# replace newline with blank
text.replace('\n', '')

# parse the text
result = re.findall(r'(?P<before>\w+ ){30}target(P?<after>\w+ ){30}', text)

From there, all 30 words before are in a group called 'before' and all 30 words after are in a group called 'after' the target word -- in this example 'target'. 从那里开始，之前的所有30个单词都在称为“之前”的组中，而之后的所有30个单词都在称为目标单词的“之后”的组中。 RegEx can be really specific or really generic, depending on the pattern used. RegEx可以是特定的，也可以是通用的，具体取决于所使用的模式。 For example, the code above only allows for one space after a word and no punctuation. 例如，上面的代码只允许在单词后留一个空格，而不能使用标点符号。 For a guide on python regex: https://docs.python.org/3/howto/regex.html 有关python regex的指南： https : //docs.python.org/3/howto/regex.html

在一个单词之前和之后输出字符，然后写入文件

问题描述

1 个解决方案

解决方案1
4 2018-10-04 19:42:54

在一个单词之前和之后输出字符，然后写入文件

问题描述

1 个解决方案

解决方案1 4 2018-10-04 19:42:54

解决方案1
4 2018-10-04 19:42:54