提高Perl Regex中最后一次匹配的性能

Question

I need to find the last occurrence of matches based on an array of acceptable of value. 我需要根据可接受的值数组找到匹配的最后一次出现。 Below is the source codes in Perl. 以下是Perl中的源代码。 The answer is Q because it is the last occurrence based on acceptable values of A, Q, I & J. 答案是Q，因为它是基于A，Q，I和J的可接受值的最后一次出现。

The challenge is how can I change my codes to make the regex faster. 挑战在于如何更改代码以使正则表达式更快。 It is currently a bottleneck because I have to run it millions times. 当前这是一个瓶颈，因为我必须运行数百万次。

my $input = "A B C D E F G H I J K L M N O P Q R S T U V W X Y Z";
my $regex = qr/(A|Q|I|J)/;

my @matches = $input =~ m/\b$regex\b/g;

print $matches[$#matches];

I would like to see new codes that improves the query speed but still can find the Q match. 我希望看到可以提高查询速度的新代码，但仍然可以找到Q匹配项。

Answer 1

You can find the last match by simply adding a .* before the matching pattern. 您只需在匹配模式之前添加.*即可找到最后一个匹配项。

Like this 像这样

my $input = "APPLE B C D E F G H INDIGO JACKAL K L M N O P QUIVER R S T U V W X Y Z";
my $regex = qr/APPLE|QUIVER|INDIGO|JACKAL/;
my ($last) = $input =~ /.*\b($regex)\b/;
print $last, "\n";

output 输出

QUIVER

Answer 2

Use \\K to discard the previously matched characters from printing at the final. 使用\\K放弃最后匹配的先前字符。

my $input = "A B C D E F G H I J K L M N O P Q R S T U V W X Y Z";
my $regex = qr/.*\K\b[AQIJ]\b/;
if ($input =~ m/$regex/) {
print $&."\n";
}

Use capturing group. 使用捕获组。

my $input = "A B C D E F G H I J K L M N O P Q R S T U V W X Y Z";
my $regex = qr/.*\b([AQIJ])\b/;
if ($input =~ m/$regex/) {
print $1."\n";
}

Update: 更新：

my $input = "Apple Orange Mango Apple";
my $regex = qr/.*\K\b(?:Apple|Range|Mango)\b/;
if ($input =~ m/$regex/) {
print $&."\n";
}

提高Perl Regex中最后一次匹配的性能

问题描述

2 个解决方案

解决方案1
3 2015-04-19 00:36:23

解决方案2
-1 已采纳 2015-04-19 00:24:56

提高Perl Regex中最后一次匹配的性能

问题描述

2 个解决方案

解决方案1 3 2015-04-19 00:36:23

解决方案2 -1 已采纳 2015-04-19 00:24:56

解决方案1
3 2015-04-19 00:36:23

解决方案2
-1 已采纳 2015-04-19 00:24:56