繁体   English   中英

正则表达式匹配以逗号分隔的字符串中的元素,忽略双引号、大括号和方括号内的逗号

[英]RegEx to match elements in a string separated by comma ignoring comma inside double quotes, curly brackets and square brackets

我有一个string值如下

String str = {"A":"do not seperate,value","B":"OPM","C":[1,2,"AB",{"1":{"1":2,"2":[1,2]},"2":2}],"D":{"1":1,"2":[{"1":1,"2":2,"3":3},9,10]}};

我如何编写一个正则表达式来捕获由不在双引号、方括号或大括号内的逗号分隔的元素? 我想通过执行以下操作来匹配并获取元素; 使用模式匹配。

List<String> list = new ArrayList<>();
Pattern pattern = Pattern.compile(regex);
Matcher matcher = pattern.matcher(str);
while (matcher.find()) {
    list.add(matcher.group());
}

元素应该是,

"A":"do not seperate,value"
"B":"OPM"
"C":[1,2,"AB",{"1":{"1":2,"2":[1,2]},"2":2}]
"D":{"1":1,"2":[{"1":1,"2":2,"3":3},9,10]}

如果字符串如下所示

String str = [1,2,{"A":1,"B":2},[19,10,11,{"A":1,"B":2}],100]

那么元素应该是

1
2
{"A":1,"B":2}
[19,10,11,{"A":1,"B":2}]
100

由于它是一个 json 字符串,您可以使用对象映射器解析它并将它们作为哈希图中的键值对获取。

import com.fasterxml.jackson.core.type.TypeReference;
import com.fasterxml.jackson.databind.ObjectMapper;
import java.io.IOException;
import java.util.*;
import org.json.JSONObject;

public class Test {
    public static void main(String args[]) throws IOException {
        String str = "{\"A\":\"do not seperate,value\",\"B\":\"OPM\",\"C\":[1,2,\"AB\",{\"1\":{\"1\":2,\"2\":[1,2]},\"2\":2}],\"D\":{\"1\":1,\"2\":[{\"1\":1,\"2\":2,\"3\":3},9,10]}}";
        JSONObject obj = new JSONObject(str);
        HashMap<String, Object> result = new ObjectMapper().readValue(str, new TypeReference<Map<String, Object>>() {
        });
        result.entrySet().stream().forEach(System.out::println);
    }
}

输出

A=do not seperate,value
B=OPM
C=[1, 2, AB, {1={1=2, 2=[1, 2]}, 2=2}]
D={1=1, 2=[{1=1, 2=2, 3=3}, 9, 10]}

你可能可以做这样的事情

public static List<String> getStringElements(String str) {
    List<String> elementsList = new ArrayList<>();
    StringBuilder element = new StringBuilder();
    int bracketsCount = 0;
    int quotesCount = 0;
    char[] strChars = str.substring(1, str.length() - 1).toCharArray();
    for (char strChar : strChars) {
        element.append(strChar);
        if (strChar == '\"') {
            quotesCount++;
        } else if (strChar == '[' && quotesCount % 2 == 0) {
            bracketsCount++;
        } else if (strChar == '{' && quotesCount % 2 == 0) {
            bracketsCount++;
        } else if (strChar == '(' && quotesCount % 2 == 0) {
            bracketsCount++;
        } else if (strChar == ']' && quotesCount % 2 == 0) {
            bracketsCount--;
        } else if (strChar == '}' && quotesCount % 2 == 0) {
            bracketsCount--;
        } else if (strChar == ')' && quotesCount % 2 == 0) {
            bracketsCount--;
        } else if (strChar == ',' && bracketsCount == 0 && quotesCount % 2 == 0) {
            elementsList.add(element.substring(0, element.length() - 1));
            element = new StringBuilder();
        }
    }
    if (element.length() > 0) {
        elementsList.add(element.toString());
    }
    return elementsList;
}
    

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM