繁体   English   中英

如何将 function 传递给 python 中的 re.findall()

[英]how to pass a function to re.findall() in python

我有一个要在大文本字符串中搜索的单词列表。我定义了一个返回每个单独单词的 function,但我不知道如何将 function 传递给 re.findall()。 我想获得包含单词列表中任何单词的任何句子。 有人可以帮忙:)这是我得到的:

strings = ['some large text', 'some large text'...]
ad = []
words = ['ascertained','deep','detected','disclosed','disinterred','espied','explored','exposed','famous','happened upon','identified','invented','learned','observed','perceived','presented','revealed','searched out','shown','sighted','spotted','unveiled']
def word():
    for i in words:
        t = word[i]
    return t

for i in range(len(strings)):
    ad += re.findall(r"([^.]*?word()[^.]*\.)",strings[i])
sep = ''
adc = sep.join(ad)

你想要这样的东西:

strings = ['some large text', 'some large text', 'dont disclosed it']
words = ['ascertained','deep','detected','disclosed','disinterred','espied','explored','exposed','famous','happened upon','identified','invented','learned','observed','perceived','presented','revealed','searched out','shown','sighted','spotted','unveiled']

reg = re.compile("(?=(" + "|".join(map(re.escape, words)) + "))")
ad = [i for i in strings if len(re.findall(reg,i))]
ad = ', '.join(ad)

广告:

'dont disclosed it'

或者

strings = ['my name is D.', 'I am 18 years old', 'I love deep learning', 'detected something fishy'] 
ad = []
words = ['ascertained','deep','detected','disclosed','disinterred','espied','explored','exposed','famous','happened upon','identified','invented','learned','observed','perceived','presented','revealed','searched out','shown','sighted','spotted','unveiled']

for i in range(len(strings)):
    ad += [strings[i] for word in words if len(re.findall(f"([^.]*?{word}[^.]*)", strings[i]))]

Output:

广告:

['I love deep learning', 'detected something fishy']
for i in range(len(strings)):
    ad += [re.findall(f"([^.]*?{word}[^.]*\.)", strings[i]) for word in words]

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM