int doesn't work in def within regular expression in python

Question

I need to write function which takes a count and a string and return a list of all of the words in the string that are count word characters long or longer.

My function is:

import re

def find_words(count, a_str):
    count = int(count)
    return re.findall(r'\w{},'.format(int(count)), a_str)

But it doesn't work, it is return empty list:

Example:

find_words(4, "dog, cat, baby, balloon, me")

Should return:

['baby', 'balloon']

Answer 1

The regex isn't correct. The {} is interpreted as placeholder for format , but you want it to be the regexs' {} which specifies the number of repeats. You need to use r'\\w{{{}}}' here. Observe the difference:

>>> r'\w{},'.format(4)
'\\w4,'

>>> r'\w{{{},}}'.format(4)
'\\w{4,}'

And then it works correctly:

import re
def find_words(count, a_str):
    count = int(count)
    return re.findall(r'\w{{{},}}'.format(count), a_str)

>>> find_words(4, "dog, cat, baby, balloon, me") 
['baby', 'balloon']

Answer 2

Why RegExp?

>>> string = "dog, cat, baby, balloon, me"
>>> [word for word in string.split(', ') if len(word) >= 4]
['baby', 'balloon']

So function could be something like follow:

>>> def find_words(count, a_str):
...     return [word for word in a_str.split(', ') if len(word) >= count]
...
>>> find_words(4, 'dog, cat, baby, balloon, me')
['baby', 'balloon']

Answer 3

You can try this:

def find_words(count, a_str):
   s = [re.findall("\w{"+str(count)+",}", i) for i in ["dog, cat, baby, balloon, me"]]
   return s[0]

print(find_words(4, ["dog, cat, baby, balloon, me"]))

Output:

['baby', 'balloon']

int doesn't work in def within regular expression in python

Question

3 answers

solution1
3 2017-08-26 12:40:59

solution2
2 2017-08-26 12:51:35

solution3
0 2017-08-26 17:06:11

int doesn't work in def within regular expression in python

Question

3 answers

solution1 3 2017-08-26 12:40:59

solution2 2 2017-08-26 12:51:35

solution3 0 2017-08-26 17:06:11

solution1
3 2017-08-26 12:40:59

solution2
2 2017-08-26 12:51:35

solution3
0 2017-08-26 17:06:11