正则表达式选择HTML标记内的特定字符

Question

I'm only looking for standard tags like p, title, h1, h2 etc. 我只是在寻找像p，title，h1，h2等标准标签。

<[/a]*>content resides in here</[/a]*>

And I'm specifically looking for punctuation marks to combat a potential SQL injection. 我特意寻找标点符号来对抗潜在的SQL注入。 Also, for this project I am unable to use BeautifulSoup. 此外，对于这个项目，我无法使用BeautifulSoup。

Answer 1

Try this regex: 试试这个正则表达式：

<(a|h1|p|title)[^>]*>([^<]+)</\1[^>]*>

Discussion 讨论

正则表达式可视化

Demo 演示

http://regex101.com/r/mB4bQ1 http://regex101.com/r/mB4bQ1

Discussion 讨论

I assume that tags will contain text only, no tags... 我假设标签只包含文本，没有标签......
Python doesn't support recursive regular expression. Python不支持递归正则表达式。

正则表达式选择HTML标记内的特定字符

问题描述

1 个解决方案

解决方案1
0 2014-01-23 22:49:56

Discussion 讨论

Demo 演示

Discussion 讨论

正则表达式选择HTML标记内的特定字符

问题描述

1 个解决方案

解决方案1 0 2014-01-23 22:49:56

Discussion 讨论

Demo 演示

Discussion 讨论

解决方案1
0 2014-01-23 22:49:56