简体繁体中英

Improve regular expression to end on quotation

原文 2014-10-28 21:20:20 1 2 python/ regex

I have the following regular expression:

>>> re.findall('http://www.rottentomatoes.com/.+', html)
['http://www.rottentomatoes.com/m/1129132-torque" class="see-all">Read More About This Movie On Rotten Tomatoes</a>']

How would I get this to match up until the " . I am trying to get the return to be:

http://www.rottentomatoes.com/m/1129132-torque

2 answers

Use a non-greedy quantifier ? to stop at the first " :

>>> html = 'http://www.rottentomatoes.com/m/1129132-torque" class="see-all">Read More About This Movie On Rotten Tomatoes</a>'
>>> re.search('(http://www\.rottentomatoes\.com/.+?)"', html).group(1)
'http://www.rottentomatoes.com/m/1129132-torque'

Just add the character(") where you want to stop. Also add ? , so that it stops at the first match.

>>> html='http://www.rottentomatoes.com/m/1129132-torque" class="see-all">Read More About This Movie On Rotten Tomatoes</a>'
>>> re.findall('http://www.rottentomatoes.com/.+?\"', html)
['http://www.rottentomatoes.com/m/1129132-torque"']

How to improve that regular expression in python?

How to improve the performance of this regular expression?

unexpected end of regular expression

Regular expression start and end with

How to improve this regular expression to work in other situations?

How to improve regular expression to extract phone numbers?

Define end and beginning of regular expression

Python Regular expression to end of line

python regular expression with “OR” and $ end of line

Regular expression exclude matches surrounded by quotation marks and lines starting with %

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to improve that regular expression in python? How to improve the performance of this regular expression? unexpected end of regular expression Regular expression start and end with How to improve this regular expression to work in other situations? How to improve regular expression to extract phone numbers? Define end and beginning of regular expression Python Regular expression to end of line python regular expression with “OR” and $ end of line Regular expression exclude matches surrounded by quotation marks and lines starting with %

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM