简体   繁体   English

使用正则表达式解析log4j日志文件

[英]parsing log4j log file using regular expression

I have created a java application for parsing the log4j log file using regular expression, The application is working fine for the log which i have shown below 我创建了一个用于使用正则表达式解析log4j日志文件的Java应用程序,该应用程序对于下面显示的日志工作正常

1999-11-27 15:49:37,459 [thread-x] ERROR mypackage - Catastrophic system failure

but not working for 但不为

2015-01-22 01:52:54,237 [http-bio-80-exec-5] FATAL   TestLog4jServlet - Show FATAL message

My log4j ConversionPattern is given below 我的log4j ConversionPattern如下

log4j.appender.Appender2.layout.ConversionPattern=%d [%t] %-7p %10c{1} - %m%n

Can anyone please tell me some solution for this 谁能告诉我一些解决方案

My code is as given below 我的代码如下

public static void main(String[] args) {
    String regex = "(\\d{4}-\\d{2}-\\d{2}) (\\d{2}:\\d{2}:\\d{2},\\d{3}) \\[(.*)\\] ([^ ]*) ([^ ]*) - (.*)$";

    Pattern p = Pattern.compile(regex);
    String[] samples = {
            "2015-01-22 01:52:54,237 [http-bio-80-exec-5] FATAL   TestLog4jServlet - Show FATAL message"
        };

    Matcher m = p.matcher(samples[1]);
    System.out.println(m.matches());
    if (m.matches() && m.groupCount() == 6) {
        String date = m.group(1);
        String time = m.group(2);
        String threadId = m.group(3);
        String priority = m.group(4);
        String category = m.group(5);
        String message = m.group(6);

        System.out.println("date: " + date);
        System.out.println("time: " + time);
        System.out.println("threadId: " + threadId);
        System.out.println("priority: " + priority);
        System.out.println("category: " + category);
        System.out.println("message: " + message);
    }
}

Because there are two spaces between FATAL and TestLog4jServlet but you included only one space in your regex. 因为FATALTestLog4jServlet之间有两个空格,但是您的正则表达式中只包含一个空格。 So i suggest you to replace the corresponding space with <space>+ which allows one or more spaces. 因此,我建议您用允许一个或多个空格的<space>+替换相应的空格。

(\d{4}-\d{2}-\d{2}) (\d{2}:\d{2}:\d{2},\d{3}) \[(.*?)\] ([^ ]*) +([^ ]*) - (.*)$
                                                                ^
                                                                |

DEMO 演示

Java regex would be, Java正则表达式将是

"(\\d{4}-\\d{2}-\\d{2}) (\\d{2}:\\d{2}:\\d{2},\\d{3}) \\[(.*)\\] ([^ ]*) +([^ ]*) - (.*)$"

我认为Logstash更适合解析日志。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM