简体   繁体   中英

How to extract the contents of square brackets in a string of text in c# using Regex

if i have a string of text like below, how can i collect the contents of the brackets in a collection in c# even if it goes over line breaks?

eg...

string s = "test [4df] test [5yu] test [6nf]";

should give me..

collection[0] = 4df

collection[1] = 5yu

collection[2] = 6nf

You can do this with regular expressions, and a bit of Linq.

    string s = "test [4df] test [5y" + Environment.NewLine + "u] test [6nf]";

    ICollection<string> matches =
        Regex.Matches(s.Replace(Environment.NewLine, ""), @"\[([^]]*)\]")
            .Cast<Match>()
            .Select(x => x.Groups[1].Value)
            .ToList();

    foreach (string match in matches)
        Console.WriteLine(match);

Output:

4df
5yu
6nf

Here's what the regular expression means:

\[   : Match a literal [
(    : Start a new group, match.Groups[1]
[^]] : Match any character except ]
*    : 0 or more of the above
)    : Close the group
\]   : Literal ]
Regex regex = new Regex(@"\[[^\]]+\]", RegexOptions.Multiline);

关键是要正确地转义正则表达式中使用的特殊字符,例如你可以匹配[字符这样: @"\\["

Regex rx = new Regex(@"\[.+?\]");
var collection = rx.Matches(s);

You will need to trim the square brackets off, the important part is the lazy operator.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM