用Java正則表達式拆分

Question

我有一個像這樣的字符串：

Snt:It was the most widespread day of environmental action in the planet's history
====================
-----------
Snt:Five years ago, I was working for just over minimum wage
====================
-----------

我想用

====================
-----------

當然從句子的第一句中刪除Snt: 什么是最好的方法？

我用了這個正則表達式，但是沒用！

String[] content1 =content.split("\\n\\====================\\n\\-----------\\n");

提前致謝。

Answer 1

關於什么

Pattern p = Pattern.compile("^Snt:(.*)$", Pattern.MULTILINE);
Matcher m = p.matcher(str);

while (m.find()) {
    String sentence = m.group(1);
}

而不是黑客各地的split ，做額外的解析，這只是看起來與“SNT”，然后捕獲任何如下開始的行。

Answer 2

由於數據的結構方式，我將把拆分的概念顛倒過來，成為匹配器。，這也使您可以很好地對Snt進行數學計算：

private static final String VAL = "Snt:It was the most widespread day of environmental action in the planet's history\n"
        + "====================\n"
        + "-----------\n"
        + "Snt:Five years ago, I was working for just over minimum wage\n"
        + "====================\n"
        + "-----------";

public static void main(String[] args) {
    List<String> phrases = new ArrayList<String>();
    Matcher mat = Pattern.compile("Snt:(.+?)\n={20}\n-{11}\\s*").matcher(VAL);
    while (mat.find()) {
        phrases.add(mat.group(1));
    }

    System.out.printf("Value: %s%n", phrases); 
}

我使用正則表達式： "Snt:(.+?)\\n={20}\\n-{11}\\\\s*"

假設文件中的第一個單詞是Snt:然后將下一個短語分組，直到定界符為止。 它將占用任何結尾的空格，使表達式為下一條記錄做好准備。

此過程的好處是，匹配項匹配單個記錄，而不是具有與一個記錄的結尾部分（也許是下一個記錄的開頭）部分匹配的表達式。

Answer 3

由於最后沒有換行符，因此它將不匹配最后的== ， --行。 您需要在最后添加行錨$的末尾，以替代正則表達式中\\n 。

String s = "Snt:It was the most widespread day of environmental action in the planet's history\n" +
"====================\n" +
"-----------\n" +
"Snt:Five years ago, I was working for just over minimum wage\n" +
"====================\n" +
"-----------";
String m = s.replaceAll("(?m)^Snt:", "");
String[] tok = m.split("\\n\\====================\\n\\-----------(?:\\n|$)");
System.out.println(Arrays.toString(tok));

輸出：

[It was the most widespread day of environmental action in the planet's history, Five years ago, I was working for just over minimum wage]

Answer 4

Matcher m = Pattern.compile("([^=\\-]+)([=\\-]+[\\t\\n\\s]*)+").matcher(str);   
while (m.find()) {
    String match = m.group(1);
    System.out.println(match);
}

用Java正則表達式拆分

問題描述

4 個解決方案

解決方案1
3 2014-10-03 17:05:08

解決方案2
2 2014-10-03 17:02:35

解決方案3
1 2014-10-03 16:55:54

解決方案4
0 2014-10-06 07:22:29

用Java正則表達式拆分

問題描述

4 個解決方案

解決方案1 3 2014-10-03 17:05:08

解決方案2 2 2014-10-03 17:02:35

解決方案3 1 2014-10-03 16:55:54

解決方案4 0 2014-10-06 07:22:29

解決方案1
3 2014-10-03 17:05:08

解決方案2
2 2014-10-03 17:02:35

解決方案3
1 2014-10-03 16:55:54

解決方案4
0 2014-10-06 07:22:29