[英]Boyer-Moore-Horspool Algorithm for All Matches (Find Byte array inside Byte array)
Here is my implementation of BMH algorithm (it works like a charm): 这是我对BMH算法的实现(它就像一个魅力):
public static Int64 IndexOf(this Byte[] value, Byte[] pattern)
{
if (value == null)
throw new ArgumentNullException("value");
if (pattern == null)
throw new ArgumentNullException("pattern");
Int64 valueLength = value.LongLength;
Int64 patternLength = pattern.LongLength;
if ((valueLength == 0) || (patternLength == 0) || (patternLength > valueLength))
return -1;
Int64[] badCharacters = new Int64[256];
for (Int64 i = 0; i < 256; ++i)
badCharacters[i] = patternLength;
Int64 lastPatternByte = patternLength - 1;
for (Int64 i = 0; i < lastPatternByte; ++i)
badCharacters[pattern[i]] = lastPatternByte - i;
// Beginning
Int64 index = 0;
while (index <= (valueLength - patternLength))
{
for (Int64 i = lastPatternByte; value[(index + i)] == pattern[i]; --i)
{
if (i == 0)
return index;
}
index += badCharacters[value[(index + lastPatternByte)]];
}
return -1;
}
I tried to modify it in order to return all the matches instead of only the first index, but I'm getting IndexOutOfRangeException everywhere D: 我试图修改它以返回所有匹配而不是仅返回第一个索引,但我到处都得到IndexOutOfRangeException D:
Obviously I'm missing something important or I didn't properly understood how it works. 显然我遗漏了一些重要的东西,或者我没有正确理解它是如何工作的。 What am I doing wrong?
我究竟做错了什么?
public static List<Int64> IndexesOf(this Byte[] value, Byte[] pattern)
{
if (value == null)
throw new ArgumentNullException("value");
if (pattern == null)
throw new ArgumentNullException("pattern");
Int64 valueLength = value.LongLength;
Int64 patternLength = pattern.LongLength;
if ((valueLength == 0) || (patternLength == 0) || (patternLength > valueLength))
return (new List<Int64>());
Int64[] badCharacters = new Int64[256];
for (Int64 i = 0; i < 256; ++i)
badCharacters[i] = patternLength;
Int64 lastPatternByte = patternLength - 1;
for (Int64 i = 0; i < lastPatternByte; ++i)
badCharacters[pattern[i]] = lastPatternByte - i;
// Beginning
Int64 index = 0;
List<Int64> indexes = new List<Int64>();
while (index <= (valueLength - patternLength))
{
for (Int64 i = lastPatternByte; value[(index + i)] == pattern[i]; --i)
{
if (i == 0)
indexes.Add(index);
}
index += badCharacters[value[(index + lastPatternByte)]];
}
return indexes;
}
Change 更改
if (i == 0)
indexes.Add(index);
to 至
if (i == 0)
{
indexes.Add(index);
break;
}
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.