Regex to parse out html from CDATA with C#

Question

I would like to parse out any HTML data that is returned wrapped in CDATA.

As an example <![CDATA[<table><tr><td>Approved</td></tr></table>]]>

Thanks!

Answer 1

The expression to handle your example would be

\<\!\[CDATA\[(?<text>[^\]]*)\]\]\>

Where the group "text" will contain your HTML.

The C# code you need is:

using System.Text.RegularExpressions;
RegexOptions   options = RegexOptions.None;
Regex          regex = new Regex(@"\<\!\[CDATA\[(?<text>[^\]]*)\]\]\>", options);
string         input = @"<![CDATA[<table><tr><td>Approved</td></tr></table>]]>";

// Check for match
bool   isMatch = regex.IsMatch(input);
if( isMatch )
  Match   match = regex.Match(input);
  string   HTMLtext = match.Groups["text"].Value;
end if

The "input" variable is in there just to use the sample input you provided

Answer 2

I know this might seem incredibly simple, but have you tried string.Replace()?

string x = "<![CDATA[<table><tr><td>Approved</td></tr></table>]]>";
string y = x.Replace("<![CDATA[", string.Empty).Replace("]]>", string.Empty);

There are probably more efficient ways to handle this, but it might be that you want something that easy...

Answer 3

没有太多细节，但如果没有你没有描述的复杂性，一个非常简单的正则表达式应该匹配它：

/<!\[CDATA\[(.*?)\]\]>/

Answer 4

找到CDATA部分的正则表达式将是：

(?:<!\[CDATA\[)(.*?)(?:\]\]>)

Answer 5

Why do you want to use Regex for such a simple task? Try this one:

str = str.Trim().Substring(9);
str = str.Substring(0, str.Length-3);

Answer 6

Regex r = new Regex("(?<=<!\[CDATA\[).*?(?=\]\])");

Regex to parse out html from CDATA with C#

Question

6 answers

solution1
8 ACCPTED 2009-05-01 17:24:38

solution2
4 2009-05-01 17:21:03

solution3
2 2009-05-01 17:22:12

solution4
1 2009-05-01 17:23:28

solution5
0 2011-09-09 15:47:47

solution6
0 2009-05-01 17:25:05

Regex to parse out html from CDATA with C#

Question

6 answers

solution1 8 ACCPTED 2009-05-01 17:24:38

solution2 4 2009-05-01 17:21:03

solution3 2 2009-05-01 17:22:12

solution4 1 2009-05-01 17:23:28

solution5 0 2011-09-09 15:47:47

solution6 0 2009-05-01 17:25:05

solution1
8 ACCPTED 2009-05-01 17:24:38

solution2
4 2009-05-01 17:21:03

solution3
2 2009-05-01 17:22:12

solution4
1 2009-05-01 17:23:28

solution5
0 2011-09-09 15:47:47

solution6
0 2009-05-01 17:25:05