Regex taking too long in python

Question

I have used regex101 to test my regex and it works fine.What i am trying to is to detect these patterns

section 1.2 random 2
1.2 random 2
1.2. random 2
random 2
random 2.

But its just random it shouldn't match if the string is like that

random

My regex is this.

  m = re.match(r"^(((section)\s*|(\d+\.)|\d+|(\d+\.\d+)|[a-zA-z\s]|[a-zA-z\.\s])+((\d+\.$)|\d+$|(\d+\.\d+$)))","random random random random random",flags = re.I)

If i give in a long string it gets stuck.Any ideas?

Answer 1

After some simplification, this regular expression meets the requirements stated above and reproduced in the test cases below.

import re

regex = r'(?:section)*\s*(?:[0-9.])*\s*random\s+(?!random)(?:[0-9.])*'

strings = [
   "random random random random random",
   "section 1.2 random 2",
   "1.2 random 2",
   "1.2. random 2",
   "random 2",
   "random 2.",
   "random",
]

for string in strings:
    m = re.match(regex, string, flags = re.I)
    if m:
        print "match on", string
    else:
        print "non match on", string

which gives an output of:

non match on random random random random random
match on section 1.2 random 2
match on 1.2 random 2
match on 1.2. random 2
match on random 2
match on random 2.
non match on random

See it in action at: https://eval.in/661183

Regex taking too long in python

Question

1 answers

solution1
2 ACCPTED 2016-10-15 23:29:08

Regex taking too long in python

Question

1 answers

solution1 2 ACCPTED 2016-10-15 23:29:08

solution1
2 ACCPTED 2016-10-15 23:29:08