如何从文本中删除特定的重复字符？

Question

我有一个像

"this is line 1\n\n\nthis is line 2\n\n\nthis is line 3\t\t\tthis is line 3 also"

我想做的是从此文本中删除重复的特定字符，例如“ \\ n”，“ \\ t”。

"this is line 1\nthis is line 2\nthis is line 3\tthis is line 3 also"

我尝试了一些正则表达式，但对我没有用。

text = text.replace("/[^\\w\\s]|(.)\\1/gi", "");

是否有此用的正则表达式？

Answer 1

如果只需要删除分隔的空白字符，则\\s将无济于事，因为它会过度匹配，即也将匹配空格，硬空格等。

您可以将字符类与char一起使用，将它们与捕获组包装在一起，并对捕获的值使用反向引用。 并替换为对组1值的反向引用：

.replaceAll("([\n\t])\\1+", "$1")

参见regex演示。

细节

Answer 2

CharMatcher.javaIsoControl().removeFrom(myString)