使用正则表达式提取值

Question

我想从这段文本（html标记）中提取值“ 64,111”。

     <tr>
     <th id="abc-xyz">Page <span class="sub">avg</span></th>
    <td headers="abc-xyz">
    10th Aug, 2011  </td>
  <td headers="abc-xyz">64,111</td>
     </tr>

我目前正在使用此正则表达式-：

Match m2 = Regex.Match(text, @"\<td headers=""abc-xyz""\>(.*?)\</td\>", RegexOptions.IgnoreCase);

但没有结果，请告诉我我做错了什么？

Answer 1

用\\转义双引号

Match m2 = Regex.Match(text, "(?<=<td\sheaders=\"abc-xyz\">).*(?=</td>)", 
                       RegexOptions.IgnoreCase);

Answer 2

代替 ”。” 使用除终止字符之外的字符类。 也就是说，您想要">([^<]*)<"而不是">(.*)<" ">([^<]*)<" 。

我假设您知道这不能替代真正的解析，而正则表达式则无法做到这一点，因此我不会对此进行宣传。 这个网站上已经有一个非常有趣的回应。

Answer 3

嗯，有多种方法可以给猫皮剥皮。
解析XML不仅限于正则表达式，因此这是使用Linq to XML的一种方法。

string found = (from td in XElement.Parse(myxml).Elements("td")
                where td.HasAttributes
                let headers = td.Attribute("headers")
                where headers != null && headers.Value == "abc-xyz" && !td.HasElements
                select td.Value).FirstOrDefault();

Linq to XML教程

使用正则表达式提取值

问题描述

3 个解决方案

解决方案1
0 已采纳 2012-10-15 00:25:37

解决方案2
0 2012-10-15 00:26:35

解决方案3
0 2012-10-15 01:32:15

使用正则表达式提取值

问题描述

3 个解决方案

解决方案1 0 已采纳 2012-10-15 00:25:37

解决方案2 0 2012-10-15 00:26:35

解决方案3 0 2012-10-15 01:32:15

解决方案1
0 已采纳 2012-10-15 00:25:37

解决方案2
0 2012-10-15 00:26:35

解决方案3
0 2012-10-15 01:32:15