在 Python 中使用正则表达式查找子字符串？

Question

I have A string:我有一个字符串：

line_to_test = "http://website/[SequenceOfLetters&NumbersONLY].html"

I want a regex for matching the above pattern:我想要一个正则表达式来匹配上述模式：

what i have tried currently is:我目前尝试过的是：

c = len(re.findall(r"http:\/\/website\/([a-zA-Z0-9]?).html",line_to_test))

But c here comes to be null even when the line_to_test contains the pattern.但是即使line_to_test包含模式，这里的c也会为空。

Answer 1

? means whatever preceded it was optional, in this case [a-zA-Z0-9] .意味着它之前的任何内容都是可选的，在这种情况下是[a-zA-Z0-9] 。 That means you can have a letter or number either 0 or 1 times.这意味着您可以有0次或1次的字母或数字。

You should use the * , to select it 0 times or more, or use the + , to select it 1 times` or more.您应该使用* , 选择它0次或更多次，或使用+ , 选择它1次或更多。

Try this RegEx:试试这个正则表达式：

c = len(re.findall(r"http:\/\/website\/([a-zA-Z0-9]+).html",line_to_test))

If you used a * , it would be the same as ([a-zA-Z0-9]+)?如果您使用* ，它将与([a-zA-Z0-9]+)? , meaning http://website/.html would work. ，这意味着http://website/.html可以工作。

Live Demo on RegExr RegExr 上的现场演示

Answer 2

? will only match 0 or 1 character.只会匹配 0 或 1 个字符。 Try尝试

c = len(re.findall(r"http:\/\/website\/([a-zA-Z0-9]+).html",line_to_test))

You can use an online service like regexr to test your regexes: http://regexr.com/3d301您可以使用像 regexr 这样的在线服务来测试您的正则表达式： http ://regexr.com/3d301

在 Python 中使用正则表达式查找子字符串？

问题描述

2 个解决方案

解决方案1
2 2016-03-24 10:36:07

解决方案2
0 已采纳 2016-03-24 10:28:00

在 Python 中使用正则表达式查找子字符串？

问题描述

2 个解决方案

解决方案1 2 2016-03-24 10:36:07

解决方案2 0 已采纳 2016-03-24 10:28:00

解决方案1
2 2016-03-24 10:36:07

解决方案2
0 已采纳 2016-03-24 10:28:00