删除HTML标记，但不使用正则表达式C＃

Question

I wanted to strip all the html but preserve  tags using regex. 我想剥离所有html，但使用正则表达式保留标记。 Is there a better way to do instead of 有没有更好的方法可以代替

Replace  with a non html tag like $b$ 将替换为非html标签，例如$ b $
Remove all html tags using <[^>]*> 使用<[^>]*>删除所有html标记
Replace $b$ with  将$ b $替换为

Answer 1

Below is one approach that will only permit opening and closing b tags. 以下是一种仅允许打开和关闭b标签的方法。 Any other tags are removed. 任何其他标签都将被删除。

var teststring = "Test <b>test</b> lorem <i>ipsum</i>";
var pattern = @"(?!</?b>)<.*?>"; // assuming open and closing tags are retained
Console.WriteLine(Regex.Replace
       (teststring,
         pattern,
         String.Empty,
         RegexOptions.Multiline));

Outputs: Test test lorem ipsum 输出： Test test lorem ipsum

删除HTML标记，但不使用正则表达式C＃

问题描述

1 个解决方案

解决方案1
5 已采纳 2013-05-01 02:08:02

删除HTML标记，但不使用正则表达式C＃

问题描述

1 个解决方案

解决方案1 5 已采纳 2013-05-01 02:08:02

解决方案1
5 已采纳 2013-05-01 02:08:02