无法解析正则表达式

Question

I've some data and I want to extract some details from it. 我有一些数据，我想从中提取一些细节。

<div id="ctl00_hpVendorManager">
<h5 class="panelTitle"><span class="title ">Vendor Manager(s)</span></h5>
Resource Manager: CHS MSP HOUSE<br>
Resource Administrator:     
</div>

I want to extract data between </h5> and </div> . 我想提取</h5>和</div>之间的数据。

Here is the regular expression that I've tried. 这是我尝试过的正则表达式。

>Vendor Manager\(s\).*?<\/h5>(.*?)<\/

but it doesn't seems working. 但它似乎不起作用。

any clue where I'm doing wrong. 任何我做错地方的线索。

Answer 1

First of all you shouldn't use regular expression for such tasks. 首先，您不应该对此类任务使用正则表达式。 Parse the HTML and use something like XPath to extract a portion of it. 解析HTML并使用XPath之类的东西提取其中的一部分。

In case you still want to do it, try a pattern like this: 如果您仍然想要这样做，请尝试以下模式：

<\\/h5>(?s)(.*)<\\/div>

Answer 2

try this: 尝试这个：

<\\/h5>(.|\\n)*?<\\/div>

demo 演示

无法解析正则表达式

问题描述

2 个解决方案

解决方案1
3 2017-01-07 23:35:34

解决方案2
2 已采纳 2017-01-07 23:29:32

无法解析正则表达式

问题描述

2 个解决方案

解决方案1 3 2017-01-07 23:35:34

解决方案2 2 已采纳 2017-01-07 23:29:32

解决方案1
3 2017-01-07 23:35:34

解决方案2
2 已采纳 2017-01-07 23:29:32