python regex - matching tail of substring match

Question

I've got an address string of type

'Suite 100 <building name>, <street number, name + rest of address>'

and I'm trying to extract the suite part and the rest of the address line after the suite part, using regex, but it's not working as expected. Here's what I'm using:

>> res = re.match(r'Suite \d+ (\S+)?', 'Suite 250 Victory Plaza, 100 Sunshine Street, Paradise City 99999')
>> res.groups()
>> ('Victory',)

I want the result to have two groups, the first containing 'Suite 250' and the second to have the rest of the string. How can I do this?

Answer 1

Try the following:

r"(Suite \d+)\s*(.+)"

The parts in parentheses are the groups to be captured. '.' matches any character (except new lines, unless you use the DOTALL flag.)

Two things are wrong with your pattern. 1) You are not capturing the "Suite \\d+" part as it is not surrounded by parentheses. 2) "\\S" matches any character except white space, this is why you only capture the first word.

python regex - matching tail of substring match

Question

1 answers

solution1
3 ACCPTED 2016-12-09 14:00:57

python regex - matching tail of substring match

Question

1 answers

solution1 3 ACCPTED 2016-12-09 14:00:57

solution1
3 ACCPTED 2016-12-09 14:00:57