regular expressions: quantifying a non-capturing group

Question

See here for some background on what I'm trying to do. In short, I want to match any paths under a /path/foo/ , unless the leaf directory (not the leaf file ), is script .

There's some answers in that quesiton that seem to work, but I'm trying to figure out why a certain solution I attempted did NOT work. The regex is this:

^/path/foo(?:/[^/]+)*(?!/script)/[^/]*$

My admittedly limited understanding of this is the following:

the literal string /path/foo
any number of occurrences of the submatch /[^/]+ . Basically, 0 or more repeated patterns of / followed by some directory name (I'm aware of the issues with spaces or special characters in file paths. I'm ignoring that for now)
NOT the literal string /script . So if, after however many repeated folders from (2), the next thing is /script , it fails, assuming it is then followed by...
a literal /
1 or more non- / characters, followed by the end of the string.

However, this doesn't work . It seems to match everything that starts with /path/foo .

What's wrong with this regex?

Answer 1

Consider input:

/path/foo/a/b/script/file

Regex matches as follows:

^                 Ok: No text before here
/path/foo         "/path/foo"
(?:/[^/]+)*       "/a/b/script"
(?!/script)       Ok: Text after here is "/file"
/                 "/"
[^/]*             "file"
$                 Ok: No text after here

What you wanted is a negative lookbehind, not a negative lookahead:

^                 Ok: No text before here
/path/foo         "/path/foo"
(?:/[^/]+)*       "/a/b/script"
(?<!/script)      Fail: Text before here is "/script"
/                 "/"
[^/]*             "file"
$                 Ok: No text after here

regular expressions: quantifying a non-capturing group

Question

1 answers

solution1
3 2018-07-19 17:42:34

regular expressions: quantifying a non-capturing group

Question

1 answers

solution1 3 2018-07-19 17:42:34

solution1
3 2018-07-19 17:42:34