正則表達式與C＃中的字符串不匹配

Question

我有一些HTML（我需要在一個大文檔中）將其解析為文本，而我感興趣的部分如下所示：

...
<div id="whatever" class="whatever whatever">some title with <em>html</em> and other such tags in it, but never a div tag</div>
...

現在，我想從中獲取帶有HTML的DIV中的文本。 這是我對正則表達式的使用（使用組）：

<div id=\"whatever\" class=\"whatever whatever\">(?<title>[^</div>]*?)</div>

因此，我的想法是將整個內容匹配起來，並得到一個包含所有文本的組，直到出現</ div>為止（因為該字符串的末尾沒有其他識別因素）。

[]中的^不起作用，因為它是這些字符的“任意”，而不是我想要的字符串“ </ div>”。 有什么想法可以使我工作嗎？

Answer 1

Match m=Regex.Match(s,"\\<div id=\"whatever\" class=\"whatever whatever\">(.*?)\\<\\/div\\>");                                                       
Console.WriteLine(m.Groups[1].Value);

正則表達式與C＃中的字符串不匹配

問題描述

1 個解決方案

解決方案1
0 2012-06-11 00:28:37

正則表達式與C＃中的字符串不匹配

問題描述

1 個解決方案

解決方案1 0 2012-06-11 00:28:37

解決方案1
0 2012-06-11 00:28:37