Java中基于正则表达式的字符串拆分

Question

String delimiterRegexp = "(;|:|[^<]/)";
String value = "get/time/pick me <i>Jack</i>";
String[] splitedTexts = value.split(delimiterRegexp);
for (String text : splitedTexts) {
System.out.println(text);
}

Output:
ge
tim
pick me <i>Jack</i>

Expected Result: 
get
time
pick me <i>Jack</i>

A character is getting added as delimeter along with /. 字符将与/一起作为分隔符添加。 Could anyone help me out to write regex to split text based on delimeter"/" and it should ignore xml end tag" 任何人都可以帮我写正则表达式以基于分隔符“ /”分割文本，并且它应该忽略xml结束标记“

Answer 1

Your regex should be like this: 您的正则表达式应如下所示：

(;|:|(?<!<)/)

with a negative lookbehind, demo: https://regex101.com/r/2k1WI5/1/ 后面带有负面效果的演示： https ：//regex101.com/r/2k1WI5/1/

Your current regex [^<]/ will match basically any character that is not < followed by / even \\n , space, and Japanese characters. 您当前的正则表达式[^<]/基本上将匹配所有非<后跟/甚至\\n ，空格和日语字符的字符。

That's why you are losing some letters as they are considered as part of the separator. 这就是为什么您会丢失一些字母，因为它们被视为分隔符的一部分。

Following The fourth bird recommendation, you can even simplify the regex into: ([;:]|(?<!<)/) 按照第四个鸟的建议，您甚至可以将正则表达式简化为： ([;:]|(?<!<)/) ;： ([;:]|(?<!<)/)

Answer 2

[^<]/ will match e/ and t/ [^<]/将匹配e/和t/

use a lookbehind instead, it will have the wanted behaviour to only consider / as separator if it's not a closing tag 改用lookbehind，如果不是结束标记，它将只具有将/视为分隔符的期望行为

On regex101.com 在regex101.com上

(?<!<)/

The whole regex 整个正则表达式

(;|:|(?<!<)/)

Java中基于正则表达式的字符串拆分

问题描述

2 个解决方案

解决方案1
4 2019-03-18 12:41:39

解决方案2
3 已采纳 2019-03-18 12:39:39

Java中基于正则表达式的字符串拆分

问题描述

2 个解决方案

解决方案1 4 2019-03-18 12:41:39

解决方案2 3 已采纳 2019-03-18 12:39:39

解决方案1
4 2019-03-18 12:41:39

解决方案2
3 已采纳 2019-03-18 12:39:39