match part of a string until it reaches the end of the line (python regex)

Question

If I have a large string with multiple lines and I want to match part of a line only to end of that line, what is the best way to do that?

So, for example I have something like this and I want it to stop matching when it reaches the new line character.

r"(?P<name>[A-Za-z\s.]+)"

I saw this in a previous answer :

$ - indicates matching to the end of the string, or end of a line if multiline is enabled.

My question is then how do you "enable multiline" as the author of that answer states?

Answer 1

Simply use

r"(?P<name>[A-Za-z\t .]+)"

This will match ASCII letters, spaces, tabs or periods. It'll stop at the first character that's not included in the group - and newlines aren't (whereas they are included in \\s , and because of that it's irrelevant whether multiline mode is turned on or off).

Answer 2

You can enable multiline matching by passing re.MULTILINE as the second argument to re.compile() . However, there is a subtlety to watch out for: since the + quantifier is greedy, this regular expression will match as long a string as possible, so if the next line is made up of letters and whitespace, the regex might match more than one line ( $ matches the end of any string).

There are three solutions to this:

Change your regex so that, instead of matching any whitespace including newline ( \\s ) your repeated character set does not match that newline.
Change the quantifier to +? , the non-greedy ("minimal") version of + , so that it will match as short a string as possible and therefore stop at the first newline.
Change your code to first split the text up into an individual string for each line (using text.split('\\n') .

Answer 3

请查看http://docs.python.org/library/re.html#module-contents上的flags参数

match part of a string until it reaches the end of the line (python regex)

Question

3 answers

solution1
8 ACCPTED 2011-09-09 20:06:29

solution2
2 2011-09-09 19:47:53

solution3
1 2011-09-09 19:36:36

match part of a string until it reaches the end of the line (python regex)

Question

3 answers

solution1 8 ACCPTED 2011-09-09 20:06:29

solution2 2 2011-09-09 19:47:53

solution3 1 2011-09-09 19:36:36

solution1
8 ACCPTED 2011-09-09 20:06:29

solution2
2 2011-09-09 19:47:53

solution3
1 2011-09-09 19:36:36