查找所有出现的正则表达式模式，但忽略包含另一个模式的出现

Question

I have a block of text that I'm trying to parse:我有一段文本要解析：

「<%sM_item2><%sM_plusnum2>の|　<%sM_slot>の部分を|　<%sM_change_color>に　カラーリングするのですね？|<br>|「それでは　<%sM_item>が　１０本と|　<%nM_gold>ゴールドが必要ですが　よろしいですか？|<yesno><close>

In this block of text, I'm trying to regex split on all occurrences of <???> , EXCEPT for when it matches on <%???> .在这个文本块中，我试图对所有出现的<???>进行正则表达式拆分，除了当它在<%???>上匹配时。

I have it mostly working with this:我主要使用它：

re.split(r'<((?!%).+?)>', source_text)

['「<%sM_item2><%sM_plusnum2>の|\u3000<%sM_slot>の部分を|\u3000<%sM_change_color>に\u3000カラーリングするのですね？|', 'br', '|「それでは\u3000<%sM_item>が\u3000１０
本と|\u3000<%nM_gold>ゴールドが必要ですが\u3000よろしいですか？|', 'yesno', '', 'close', '']

My problem is although it kept the <%???> tags in place, it somehow stripped the <> characters from the matches (notice 'yesno', 'close', and 'br' tags no longer have those characters).我的问题是虽然它保留了<%???>标签，但它以某种方式从匹配中剥离了<>字符（注意“yesno”、“close”和“br”标签不再有这些字符）。

Answer 1

Based on the documentation of re.split :基于re.split的文档：

Split string by the occurrences of pattern. If capturing parentheses are used 
in pattern, then the text of all groups in the pattern are also returned as 
part of the resulting list.

In this case, my parentheses needs to be placed on the outside of the match to preserve the () .在这种情况下，我的括号需要放在匹配的外部以保留() 。

re.split('(<(?!%).+?>)', source_text)
['「<%sM_item2><%sM_plusnum2>の|\u3000<%sM_slot>の部分を|\u3000<%sM_change_color>に\u3000カラーリングするのですね？|', '<br>', '|「それでは\u3000<%sM_item>が\u3000１０本と|\u3000<%nM_gold>ゴールドが必要ですが\u3000よろしいですか？|', '<yesno>', '', '<close>', '']

查找所有出现的正则表达式模式，但忽略包含另一个模式的出现

问题描述

1 个解决方案

解决方案1
0 2021-11-22 03:42:25

查找所有出现的正则表达式模式，但忽略包含另一个模式的出现

问题描述

1 个解决方案

解决方案1 0 2021-11-22 03:42:25

解决方案1
0 2021-11-22 03:42:25