Regex doesnt work with re.findall

Question

I have a unicode string of the format

I have this unicode string :

unistr= [something::a.b.c][someotherthing::e.f.g]

I tried to write a regex that takes in only the strings before and after the "::" delimiter. I tried testing this regex: ([\\w\\.]).+?(?=\\:\\:) with my string in an online regex builder and it gave me out the desired result.

However when I wrapped it within this re.findall function, it doesn't give me the same result. it gives out [c,g] This is what I tried:

re.findall(r'([\w\.]).+?(?=\:\:)',unistr) #to get the string before "::"
re.findall(r'.+?([\w\.]\:\:)',unistr) # to get after "::"

What am I doing wrong?

Answer 1

I think you tested it wrong somehow. I modified it with this expression: ([\\w\\.])+ instead on Pythex and it captured two groups, someotherstring and efg , which is what I think you want, right?

Answer 2

I think you need to use finditer with ([^\\[]*)\\:{2}([^\\]]*) regex to get the :: -delimited contents inside the square brackets:

import re
unistr = u'unistr= [something::a.b.c]'
print [[x.group(1), x.group(2)] for x in re.finditer(ur'([^\[]*)\:{2}([^\]]*)',unistr)]

Output of a sample program :

[[u'something', u'a.b.c']]

Answer 3

You can use the following :

import re
unistr= 'something::a.b.c'
print re.findall(r'^.+?(?=::)',unistr)
print re.findall(r'(?<=::).+?$',unistr)

Output:

['something']                                                                
['a.b.c']

Answer 4

Use this:

unistr= '[something::a.b.c][someotherthing::e.f.g]'
map(lambda v: v.split('::'), re.findall(r'\w+\:\:[\w\.]+', unistr))

Output:

Out[412]:
[['something', 'a.b.c'], ['someotherthing', 'e.f.g']]

Answer 5

I wouldn't complicate things, this will work:

re.findall(r'(\w+)::', unistr)

It matches word characters followed by :: and captures it, returns a list containing all matches.

Note that : is not a special character, shouldn't be escaped.

Regex doesnt work with re.findall

Question

5 answers

solution1
1 2015-04-27 08:59:14

solution2
1 2015-04-27 09:01:13

solution3
1 2015-04-27 09:08:28

solution4
1 2015-04-27 09:37:52

solution5
1 ACCPTED 2015-04-27 12:39:34

Regex doesnt work with re.findall

Question

5 answers

solution1 1 2015-04-27 08:59:14

solution2 1 2015-04-27 09:01:13

solution3 1 2015-04-27 09:08:28

solution4 1 2015-04-27 09:37:52

solution5 1 ACCPTED 2015-04-27 12:39:34

solution1
1 2015-04-27 08:59:14

solution2
1 2015-04-27 09:01:13

solution3
1 2015-04-27 09:08:28

solution4
1 2015-04-27 09:37:52

solution5
1 ACCPTED 2015-04-27 12:39:34