正则表达式：匹配除特定模式以外的所有内容

Question

I need a regular expression able to match everything but a string starting with a specific pattern (specifically index.php and what follows, like index.php?id=2342343 ).我需要一个能够匹配除以特定模式开头的字符串以外的所有内容的正则表达式（特别是index.php以及后面的内容，例如index.php?id=2342343 ）。

Answer 1

Regex: match everything but :正则表达式：匹配所有内容，但：

a string starting with a specific pattern (eg any - empty, too - string not starting with foo ):以特定模式开头的字符串（例如，任何 - 也为空 - 不以foo开头的字符串）：
- Lookahead-based solution for NFAs: NFA 基于前瞻的解决方案：
  - ^(?!foo).*$
  - ^(?!foo)
Negated character class based solution for regex engines not supporting lookarounds :不支持环视的正则表达式引擎的基于否定字符类的解决方案：
- ^(([^f].{2}|.[^o].|.{2}[^o]).*|.{0,2})$
- ^([^f].{2}|.[^o].|.{2}[^o])|^.{0,2}$
a string ending with a specific pattern (say, no world. at the end):以特定模式结尾的字符串（例如，没有world.最后）：
- Lookbehind-based solution:基于Lookbehind的解决方案：
  - (?<!world\\.)$
  - ^.*(?<!world\\.)$
- Lookahead solution:前瞻解决方案：
  - ^(?!.*world\\.$).*
  - ^(?!.*world\\.$)
- POSIX workaround: POSIX 解决方法：
  - ^(.*([^w].{5}|.[^o].{4}|.{2}[^r].{3}|.{3}[^l].{2}|.{4}[^d].|.{5}[^.])|.{0,5})$
  - ([^w].{5}|.[^o].{4}|.{2}[^r].{3}|.{3}[^l].{2}|.{4}[^d].|.{5}[^.]$|^.{0,5})$
a string containing specific text (say, not match a string having foo ):包含特定文本的字符串（例如，不匹配具有foo的字符串）：
- Lookaround-based solution:基于环视的解决方案：
  - ^(?!.*foo)
  - ^(?!.*foo).*$
- POSIX workaround: POSIX 解决方法：
  - Use the online regex generator at www.formauri.es/personal/pgimeno/misc/non-match-regex使用www.formauri.es/personal/pgimeno/misc/non-match-regex上的在线正则表达式生成器
a string containing specific character (say, avoid matching a string having a | symbol):包含特定字符的字符串（例如，避免匹配具有|符号的字符串）：
- ^[^|]*$
a string equal to some string (say, not equal to foo ):等于某个字符串的字符串（例如，不等于foo ）：
- Lookaround-based:基于环视：
  - ^(?!foo$)
  - ^(?!foo$).*$
- POSIX: POSIX：
  - ^(.{0,2}|.{4,}|[^f]..|.[^o].|..[^o])$
a sequence of characters :一个字符序列：
- PCRE (match any text but cat ): /cat(*SKIP)(*FAIL)|[^c]*(?:c(?!at)[^c]*)*/i or /cat(*SKIP)(*FAIL)|(?:(?!cat).)+/is PCRE （匹配除cat之外的任何文本）： /cat(*SKIP)(*FAIL)|[^c]*(?:c(?!at)[^c]*)*/i或/cat(*SKIP)(*FAIL)|(?:(?!cat).)+/is
- Other engines allowing lookarounds: (cat)|[^c]*(?:c(?!at)[^c]*)* (or (?s)(cat)|(?:(?!cat).)* , or (cat)|[^c]+(?:c(?!at)[^c]*)*|(?:c(?!at)[^c]*)+[^c]* ) and then check with language means: if Group 1 matched, it is not what we need, else, grab the match value if not empty其他允许环视的引擎： (cat)|[^c]*(?:c(?!at)[^c]*)* （或(?s)(cat)|(?:(?!cat).)*或(cat)|[^c]+(?:c(?!at)[^c]*)*|(?:c(?!at)[^c]*)+[^c]* ) 然后检查语言意思是：如果第1组匹配，它不是我们需要的，否则，如果不为空，则获取匹配值
a certain single character or a set of characters :某个单个字符或一组字符：
- Use a negated character class : [^az]+ (any char other than a lowercase ASCII letter)使用否定字符类： [^az]+ （除小写 ASCII 字母以外的任何字符）
- Matching any char(s) but |匹配任何字符，但| : [^|]+ : [^|]+

Demo note : the newline \\n is used inside negated character classes in demos to avoid match overflow to the neighboring line(s).演示说明：换行符\\n用于演示中的否定字符类中，以避免匹配溢出到相邻行。 They are not necessary when testing individual strings.在测试单个字符串时，它们不是必需的。

Anchor note : In many languages, use \\A to define the unambiguous start of string, and \\z (in Python, it is \\Z , in JavaScript, $ is OK) to define the very end of the string.锚注：在许多语言中，使用\\A来定义字符串的明确开头，并使用\\z （在 Python 中是\\Z ，在 JavaScript 中， $可以）定义字符串的最后。

Dot note : In many flavors (but not POSIX, TRE, TCL), .点注：在许多口味中（但不是 POSIX、TRE、TCL） . matches any char but a newline char.匹配除换行符以外的任何字符。 Make sure you use a corresponding DOTALL modifier ( /s in PCRE/Boost/.NET/Python/Java and /m in Ruby) for the . /m . /s to match any char including a newline.匹配任何字符，包括换行符。

Backslash note : In languages where you have to declare patterns with C strings allowing escape sequences (like \\n for a newline), you need to double the backslashes escaping special characters so that the engine could treat them as literal characters (eg in Java, world\\. will be declared as "world\\\\." , or use a character class: "world[.]" ).反斜杠注意：在必须使用允许转义序列的 C 字符串声明模式的语言中（例如\\n用于换行符），您需要将反斜杠加倍以转义特殊字符，以便引擎可以将它们视为文字字符（例如在 Java 中， world\\.将被声明为"world\\\\." ，或使用字符类： "world[.]" ）。 Use raw string literals (Python r'\\bworld\\b' ), C# verbatim string literals @"world\\."使用原始字符串文字 (Python r'\\bworld\\b' )、C# 逐字字符串文字@"world\\." , or slashy strings/regex literal notations like /world\\./ . ，或像/world\\./这样的斜线字符串/正则表达式文字符号。

Answer 2

不是正则表达式专家，但我认为您可以从一开始就使用负前瞻，例如^(?!foo).*$不应匹配以foo开头的任何内容。

Answer 3

You can put a ^ in the beginning of a character set to match anything but those characters.您可以将^放在字符集的开头以匹配除这些字符之外的任何内容。

[^=]*

will match everything but =将匹配所有内容，但=

Answer 4

只需匹配/^index\\.php/然后拒绝匹配它的任何内容。

Answer 5

In python:在蟒蛇中：

>>> import re
>>> p='^(?!index\.php\?[0-9]+).*$'
>>> s1='index.php?12345'
>>> re.match(p,s1)
>>> s2='index.html?12345'
>>> re.match(p,s2)
<_sre.SRE_Match object at 0xb7d65fa8>

Answer 6

I need a regex able to match everything but except a string starting with index.php a specific pattern (specifically index.php and what follows, like index.php?id=2342343)我需要一个能够匹配除以index.php开头的字符串以外的所有内容的正则表达式和特定模式（特别是 index.php 以及后面的内容，例如 index.php?id=2342343）

Use method Exec使用方法Exec

 let match, arr = [], myRe = /([\\s\\S]+?)(?:index\\.php\\?id.+)/g; var str = 'http://regular-viragenia/index.php?id=2342343'; while ((match = myRe.exec(str)) != null) { arr.push(match[1]); } console.log(arr);

 var myRe = /([\\s\\S]+?)(?:index\\.php\\?id=.+)/g; var str = 'http://regular-viragenia/index.php?id=2342343'; var matches_array = myRe.exec(str); console.log(matches_array[1]);

OR OTHER MATCH或其他比赛

 let match, arr = [], myRe = /index.php\\?id=((?:(?!index)[\\s\\S])*)/g; var str = 'http://regular-viragenia/index.php?id=2342343index.php?id=111index.php?id=222'; while ((match = myRe.exec(str)) != null) { arr.push(match[1]); } console.log(arr);

Answer 7

I had this problem for multiple search and replace.我在多次搜索和替换时遇到了这个问题。 Needed a negative pattern to skip matching till the next search需要一个否定模式来跳过匹配直到下一次搜索

import re

text = "alex ![image]dfsf(dfd.png) [image]fsdf(dfd.png) home ![image]fdsf(dfd.png) end"
replaced_text = re.sub(r'!\[image\](.*)\(.*\.png\)', '*', text)
print(replaced_text)

gave给了

alex * end

basically, the middle was swallowing till the next .png基本上，中间一直在吞咽直到下一个.png

Used the method https://stackoverflow.com/a/17761124/429476 by Firish and got what I wanted.使用 Firish 的方法https://stackoverflow.com/a/17761124/429476得到了我想要的。 Here the character space is not matched;这里没有匹配到字符空间； and the next words are separated by space并且接下来的单词由空格分隔

replaced_text = re.sub(r'!\[image\]([^ ]*)\([^ ]*\.png\)', '*', text)

and got what I wanted得到了我想要的

alex * [image]fsdf(dfd.png) home * end

Answer 8

grep -v in shell grep -v在外壳中

!~ in perl ！〜在perl中

Please add more in other languages - I marked this as Community Wiki. 请添加其他语言的其他内容-我将此标记为社区Wiki。

Answer 9

How about not using regex:不使用正则表达式怎么样：

// In PHP
0 !== strpos($string, 'index.php')

正则表达式：匹配除特定模式以外的所有内容

问题描述

6 个解决方案

解决方案1
483 2016-06-23 10:12:19

解决方案2
327 已采纳 2009-11-06 13:40:12

解决方案3
302 2013-07-20 10:13:51

解决方案4
5

解决方案5
4 2009-11-06 13:41:23

解决方案6
0 2019-04-19 05:43:26

解决方案7
-1 2023-01-22 10:29:55

解决方案8
-4

解决方案9
-17 2009-11-06 13:50:17

正则表达式：匹配除特定模式以外的所有内容

问题描述

6 个解决方案

解决方案1 483 2016-06-23 10:12:19

解决方案2 327 已采纳 2009-11-06 13:40:12

解决方案3 302 2013-07-20 10:13:51

解决方案4 5

解决方案5 4 2009-11-06 13:41:23

解决方案6 0 2019-04-19 05:43:26

解决方案7 -1 2023-01-22 10:29:55

解决方案8 -4

解决方案9 -17 2009-11-06 13:50:17

解决方案1
483 2016-06-23 10:12:19

解决方案2
327 已采纳 2009-11-06 13:40:12

解决方案3
302 2013-07-20 10:13:51

解决方案4
5

解决方案5
4 2009-11-06 13:41:23

解决方案6
0 2019-04-19 05:43:26

解决方案7
-1 2023-01-22 10:29:55

解决方案8
-4

解决方案9
-17 2009-11-06 13:50:17