[英]using regular expression for searching multiple key words from cells
正则表达式:
\b(phrase)\b\W+(?:\w+\W+){0,6}?\b(one|two|three|four)\b
\\b(phrase)\\b
匹配单词边界上的phrase
。W+
:匹配一个或多个非单词字符(通常是空格)。(?:\\w+\\W+){0,6}?
匹配 0 到 6 次,尽可能少,一个或多个单词字符后跟一个或多个非单词字符。\\b(one|two|three|four)\\b
匹配单词边界上的one
、 two
、 three
或four
。编码:
import re
text = "This sentence has phrase one and phrase word word two and phrase word three and phrase four phrase too many words too many words too many words four again."
l = [m[1] + ' ' + m[2] for m in re.finditer(r'\b(phrase)\b\W+(?:\w+\W+){0,6}?\b(one|two|three|four)\b', text)]
print(l)
印刷:
['phrase one', 'phrase two', 'phrase three', 'phrase four']
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.