extract words with specific character sequence

Question

I have a list of strings. I only want to extract the words within each string that have a specific character sequence.

For example

l1=["grad madd have", "ddim middle left"]

I want all the words that have sequence "dd"

so I would like to get

[["madd"], ["ddim", "middle"]]

I've been trying patterns of the form

[re.findall(r'(\b.*?dd.*\s+)',word) for word in l1]

but have had little success

Answer 1

You can just use list comprehension for this. You don't need regex to accomplish what you're trying to do.

See code in use here

l1=["grad madd have", "ddim middle left"]
print([s for a in l1 for s in a.split() if 'dd' in s])

This loops over l1 and splits each value by the space character. It then tests that substring to see if it contains dd and returns it if it does.

Answer 2

您接近了，您想要使用\\w*将单词字符0匹配很多次：

[re.findall(r'\w*dd\w*', word) for word in l1]

Answer 3

You can try with this Regex : \\b\\w*dd\\w*\\b

Regex101 Demo.

Answer 4

Try this in one line:

l1=["grad madd have", "ddim middle left"]

print(list(map(lambda x:list(filter(lambda y:'dd' in y,x.split())),l1)))

output:

[['madd'], ['ddim', 'middle']]

extract words with specific character sequence

Question

4 answers

solution1
1 ACCPTED 2018-01-23 14:57:54

solution2
1 2018-01-23 15:00:26

solution3
0 2018-01-23 14:53:09

solution4
0

extract words with specific character sequence

Question

4 answers

solution1 1 ACCPTED 2018-01-23 14:57:54

solution2 1 2018-01-23 15:00:26

solution3 0 2018-01-23 14:53:09

solution4 0

solution1
1 ACCPTED 2018-01-23 14:57:54

solution2
1 2018-01-23 15:00:26

solution3
0 2018-01-23 14:53:09

solution4
0