Python Regex - 替换不在两个特定单词之间的字符串

Question

Given a string, I need to replace a substring with another in an area not located between two given words. 给定一个字符串，我需要在不在两个给定单词之间的区域中替换另一个子字符串。

For example: 例如：

substring: "ate" replace to "drank", 1st word - "wolf", 2nd word - "chicken"

input:  The wolf ate the chicken and ate the rooster
output: The wolf ate the chicken and drank the rooster

Currently, the only solution I have is extremely unclean: 目前，我唯一的解决方案是非常不洁净：

1) Replace the string located between the two words to a temporary substring, via Replace a string located between 1）通过替换位于其间的字符串，将位于两个单词之间的字符串替换为临时子字符串

2) replace the string I originally wanted 2）替换我原来想要的字符串

3) revert the temporary string to the original string 3）将临时字符串还原为原始字符串

Edit: 编辑：

I specifically asked a slightly different question than my case to keep the answer relevant for future readers. 我特别提出了一个与我的案例略有不同的问题，以保持答案与未来的读者相关。

My specific need is splitting a string according to ":", when I need to disregard ":" that are between "<" and ">" brackets that can be chained, where the only promise is that the number of opening brackets equal the number of closing brackets. 我特别需要根据“：”拆分一个字符串，当我需要忽略“<”和“>”括号之间可以链接的“：”时，唯一的承诺是开口括号的数量等于关闭括号的数量。

So for example, In the following case: 例如，在以下情况中：

input  a : <<a : b> c> : <a < a < b : b> : b> : b> : a
output [a, <<a : b> c>, <a < a < b : b> : b> : b>, a]

If the answers are very different, I'll start another question. 如果答案非常不同，我会提出另一个问题。

Answer 1

def repl(match):
    if match.group()=="ate":
        return "drank"
    return  match.group()


x="The wolf ate the chicken and ate the rooster"
print re.sub(r"(wolf.*chicken)|\bate\b",repl,x)

You can use a function for replacement to do the trick with re.sub 您可以使用替换函数来执行re.sub

Answer 2

Use re.sub one-liner function. 使用re.sub单行功能。

>>> s = "The wolf ate the chicken and ate the rooster"
>>> re.sub(r'wolf.*?chicken|\bate\b', lambda m: "drank" if m.group()=="ate" else m.group(), s)
'The wolf ate the chicken and drank the rooster'

Update: 更新：

Updated problem would be solved by using regex module. 使用regex模块可以解决更新的问题。

>>> s = "a : <<a : b> c> : <a < a < b : b> : b> : b> : a"
>>> [i for i in regex.split(r'(<(?:(?R)|[^<>])*>)|\s*:\s*', s) if i]
['a', '<<a : b> c>', '<a < a < b : b> : b> : b>', 'a']

DEMO DEMO

Python Regex - 替换不在两个特定单词之间的字符串

问题描述

2 个解决方案

解决方案1
3 2015-04-16 13:30:18

解决方案2
1 已采纳 2015-04-16 16:45:28

Python Regex - 替换不在两个特定单词之间的字符串

问题描述

2 个解决方案

解决方案1 3 2015-04-16 13:30:18

解决方案2 1 已采纳 2015-04-16 16:45:28

解决方案1
3 2015-04-16 13:30:18

解决方案2
1 已采纳 2015-04-16 16:45:28