使用正则表达式提取HTML页面中的href ID

Question

I was trying to extract ID which is in HTML page within href. 我试图提取href内HTML页面中的ID。 Html looks like below HTML看起来像下面

<p>To register your account, please click the following link:</p>
<p><a href="https://abc-api-test.mywebsites.net:443/#/userreg/99978f1c-4c04-41ac-abcb-5039658a1f52" target="_blank">Complete registration.</a></p>
<p>If you have any questions please do not hesitate to contact us at <a href="mailto:muaccount@aol.net">

Basically I want to extract 99978f1c-4c04-41ac-abcb-5039658a1f52 value from the above. 基本上我想从上面提取99978f1c-4c04-41ac-abcb-5039658a1f52值。

Thanks 谢谢

Answer 1

Please try this 请尝试这个

// specify Regular expression
Regex pageParser = new Regex(@"href=[""|']https://abc-api-test.mywebsites.net:443/#/userreg/(?<ID>[\S]*?)[""|']", RegexOptions.IgnoreCase | RegexOptions.Multiline);

// extract matches from your HTML
MatchCollection matches = pageParser.Matches(yourHtml);

//Iterate through each match
foreach (var m in matches)
{
      var id = m.Groups["ID"].Value;

      // do whatever you want with the ID
}

使用正则表达式提取HTML页面中的href ID

问题描述

1 个解决方案

解决方案1
2 已采纳 2016-02-13 13:29:34

使用正则表达式提取HTML页面中的href ID

问题描述

1 个解决方案

解决方案1 2 已采纳 2016-02-13 13:29:34

解决方案1
2 已采纳 2016-02-13 13:29:34