使用正则表达式查找包含重复单词的句子

Question

I've tried below.我在下面试过。

import re
sentences = "Glitches happened happened happened. Things go back to normal again."
print([re.findall(r"(\w+)\s\1", s) for s in sentences.split('.')])

I am wondering how to print the entire sentence(s) that contain duplicated words.我想知道如何打印包含重复单词的整个句子。

Answer 1

Here is one option, using a list comprehension along with re.search :这是一种选择，使用列表理解和re.search ：

inp = "Glitches happened happened happened. Things go back to normal again."
sentences = re.split(r'(?<=\.)\s+', inp)
duplicates = [s for s in sentences if re.search(r'\b(\S+)\b(?=.*\b\1\b)', s)]
print(duplicates)

This prints:这打印：

['Glitches happened happened happened.']

Answer 2

Could you please try following, using re.search function of Python's re library.您能否尝试以下操作，使用 Python 的re库的re.search功能。

>>> import re
>>> sentences = "Glitches happened happened happened. Things go back to normal again."
>>> print ( [el for el in sentences.split('. ') if re.search(r'\b(\w+)\s+\1\b', el)] )
    ['Glitches happened happened happened']

Answer 3

As workaround, you can try:作为解决方法，您可以尝试：

import re

sentences = "Glitches happened happened happened. Things go back to normal again. And once again again again."
print([s for s in sentences.split('.') if re.search(r"\b(\w+)\s+\1\b", s)])

result:结果：

['Glitches happened happened happened', ' And once again again again']

使用正则表达式查找包含重复单词的句子

问题描述

3 个解决方案

解决方案1
2 2020-11-17 06:56:54

解决方案2
2 2020-11-17 07:00:56

解决方案3
2 2020-11-17 07:02:19

使用正则表达式查找包含重复单词的句子

问题描述

3 个解决方案

解决方案1 2 2020-11-17 06:56:54

解决方案2 2 2020-11-17 07:00:56

解决方案3 2 2020-11-17 07:02:19

解决方案1
2 2020-11-17 06:56:54

解决方案2
2 2020-11-17 07:00:56

解决方案3
2 2020-11-17 07:02:19