从字符串中提取字符（正则表达式）

Question

我想从下面的示例字符串中提取粗体字符。 模式如下：

ChunkOfAlphabets_ChunkOfDigits_ CharIWant _ChunkOfDigits_CharIDontCare

“ ABC12 A 1234D”

“ ABCD34 B 5678E”

“ EF34 C 9101F”

我想出了以下代码。 似乎工作正常，但我想知道是否有更有效的方法，也许使用正则表达式？

    char extractString(string test)
    {
        bool isDigit = false;
        foreach(var c in test)
        {
            if (isDigit && !char.IsDigit(c))
                return c;

            isDigit = char.IsDigit(c);
        }

        return '0';
    }

Answer 1

如果您使用的是C＃LINQ，将会更轻松，性能更高（正则表达式会涉及很多开销）：

static char ExtractString(string test)
{
    return test.SkipWhile(c => Char.IsLetter(c))
               .SkipWhile(c => Char.IsDigit(c))
               .FirstOrDefault();

}

Answer 2

首先，一个正则表达式不应该比一个好的算法少的快。 但是，我给你一个正则表达式来尝试一下，并检查什么更快。

以下正则表达式为我提供了您想要的：

^\D+\d+([A-Za-z])\d+\D+$

我建议您使用https://regex101.com/ ，它非常适合测试类似的东西。

Answer 3

使用正则表达式，C＃中的此函数应该可以实现您期望的功能，但是我怀疑它比简单的算法更有效：

    using System.Text.RegularExpressions;

    private char extractChar(string test)
    {
        char charOut = '\0';
        var matches = Regex.Matches(test, "^[a-zA-Z]+[0-9]+([a-zA-Z])[0-9]+.+");
        if (matches.Count > 0)
            charOut = matches[0].Groups[1].Value[0];

        return charOut;
    }

Answer 4

假设

ChunkofAlphabets = [A-Za-z] <-英文字母

ChunkOfDigits = [0-9]

CharIWant =可以是除数字[0-9]之外的任何字符

假设以上，正则表达式应为

^[A-Za-z]+\d+(\D+)\d+.*$

正则表达式演示

C＃代码Ideone演示

从字符串中提取字符（正则表达式）

问题描述

4 个解决方案

解决方案1
4 已采纳 2016-04-18 06:20:27

解决方案2
3 2016-04-18 05:20:02

解决方案3
1 2016-04-18 05:47:32

解决方案4
1 2016-04-18 05:51:04

从字符串中提取字符（正则表达式）

问题描述

4 个解决方案

解决方案1 4 已采纳 2016-04-18 06:20:27

解决方案2 3 2016-04-18 05:20:02

解决方案3 1 2016-04-18 05:47:32

解决方案4 1 2016-04-18 05:51:04

解决方案1
4 已采纳 2016-04-18 06:20:27

解决方案2
3 2016-04-18 05:20:02

解决方案3
1 2016-04-18 05:47:32

解决方案4
1 2016-04-18 05:51:04