正則表達式提取HTML標簽子元素？

Question

我在HTML字符串中有以下代碼。

<h3 class="large lheight20 margintop10">
<a href="https://google.com" class="marginright5 link linkWithHash detailsLink">
<span>get the content</span>
</a>

</h3><h3 class="large lheight20 margintop10">
<a href="https://google.com" class="marginright5 link linkWithHash detailsLink">
<span>get the content</span>
</a>

</h3>

我想提取以下標簽：

    <a href="https://google.com" class="marginright5 link linkWithHash detailsLink">
    <span>get the content</span>
    </a>
<a href="https://google.com" class="marginright5 link linkWithHash detailsLink">
<span>get the content</span>
</a>

我寫了以下正則表達式：

<h3[^>]+?>(.*)<\/h3>

但是它返回錯誤的結果：

<a href="https://google.com" class="marginright5 link linkWithHash detailsLink">
<span>get the content</span>
</a>

</h3><h3 class="large lheight20 margintop10">
<a href="https://google.com" class="marginright5 link linkWithHash detailsLink">
<span>get the content</span>
</a>

請幫助我提取標簽。

Answer 1

使用此正則表達式：

<h3[^>]+?>([^$]+?)<\/h3>

這里的例子：

https://regex101.com/r/pQ5nE0/2

Answer 2

您可以嘗試：

 function getA(str) { var regex = /<a\\s+[\\s\\S]+?<\\/a>/g; while (found = regex.exec(str)) { document.write(found[0] + '<br>'); } } var str = '<h3 class="large lheight20 margintop10">\\n' + '<a href="https://google.com" class="marginright5 link linkWithHash detailsLink">\\n' + '<span>get the content</span>\\n' + '</a>\\n' + '\\n' + '</h3><h3 class="large lheight20 margintop10">\\n' + '<a href="https://google.com" class="marginright5 link linkWithHash detailsLink">\\n' + '<span>get the content</span>\\n' + '</a>\\n' + '\\n' + '</h3>'; getA(str);

正則表達式提取HTML標簽子元素？

問題描述

2 個解決方案

解決方案1
2 已采納 2016-04-25 17:45:47

解決方案2
2 2016-04-25 20:00:14

正則表達式提取HTML標簽子元素？

問題描述

2 個解決方案

解決方案1 2 已采納 2016-04-25 17:45:47

解決方案2 2 2016-04-25 20:00:14

解決方案1
2 已采納 2016-04-25 17:45:47

解決方案2
2 2016-04-25 20:00:14