正则表达式以相同的字符串开头和结尾，而不仅仅是相同的字符

Question

I want to create a regular expression to receive: 我想创建一个正则表达式来接收：

<p class="MyClass">
   <p> something 1 </p>
   <p> something 2 </p>
   <span>         <span>  // or more html tag here
   something
</p>
something's here, not in any tag!

from: 从：

<p class="MyClass">
   <p> something 1 </p>
   <p> something 2 </p>
   <span>         <span>  // or more html tag here
   something
</p>
something's here, not in any tag!

<p class="MyClass">
   <p> another thing 1</p>
   <p> another thing 2</p>
   <p> another thing 3</p>
   another thing
</p>
...

I think I will use a regex to match everything between  and the next one. 我想我将使用正则表达式来匹配和下一个之间的所有内容。 So the regex is /([\\s\\S]*)/ , work correctly in this case. 因此，正则表达式为/([\\s\\S]*)/ ，在这种情况下可以正常工作。 But it doesn't work when I want to get a notification of this page http://daotao.dut.udn.vn/sv/G_Thongbao_LopHP.aspx . 但是，当我想收到此页面的通知http://daotao.dut.udn.vn/sv/G_Thongbao_LopHP.aspx时，它不起作用。 The DOM is so strange ?! DOM是如此奇怪？

Sorry for my bad English. 对不起，我的英语不好。

Answer 1

regex should be 正则表达式应该是

(<p class="MyClass">[\s\S]*?)(?=<p class="MyClass">|$)

[\\s\\S]*? : *? ： *? is a lazy quantifier so that it matches the shortest the default is greedy (matches the largest). 是一个懒惰的量词，因此它匹配最短的默认值是贪婪（匹配最大的）。
(?=|$) : lookhead so that it does not belongs to the match, and |$ to get also the last match (?=|$) ：lookhead，因此它不属于匹配项，而|$也可以得到最后一个匹配项

正则表达式以相同的字符串开头和结尾，而不仅仅是相同的字符

问题描述

1 个解决方案

解决方案1
1 已采纳 2018-01-15 12:18:16

正则表达式以相同的字符串开头和结尾，而不仅仅是相同的字符

问题描述

1 个解决方案

解决方案1 1 已采纳 2018-01-15 12:18:16

解决方案1
1 已采纳 2018-01-15 12:18:16