简体   繁体   中英

Apache Commons CSV : Read Values with comma

I am converting CSV files to a Java Bean. I need to maintain the comma inside a value which is enclosed in "" .

Here is my code.

public static PPRCV convertContestToObj(String fileName) throws IOException {

    PPRCV pprcvHandler = PPRCVFactory.getPPRCVTable(fileName);

    CSVFormat csvFileFormat = CSVFormat.DEFAULT.newFormat(',').withEscape('"');

    List<PPRCV> pprcvs = new ArrayList<>();
    FileReader fileReader = new FileReader(fileName);

    CSVParser csvFileParser = new CSVParser(fileReader, csvFileFormat);

    List<CSVRecord> csvRecords = csvFileParser.getRecords();

    for (CSVRecord csvRecord : csvRecords) {
        pprcvs.add(pprcvHandler.populateDynamicDetails(csvRecord));
    }

    return pprcvHandler;

}

Sample CSV line:

7080001, XI, ProvinceX, TownX, BRGX, "SHOOL, BRGX", "0054A,0055A,0055B,0055C"

my DTO

private String precintCode;

private String regionName;

private String provinceName;

private String municipalityName;

private String districtName;

private String votingCenter;

private String precint;

My expected output should be

precintCode = "7080001"

regionName = "XI"

provinceName = "ProvinceX"

municipalityName = "TownX"

districtName = "BRGX"

votingCenter = "SCHOOL, BRGX"

precint = "0054A,0055A,0055B,0055C"

However actual output is this

precintCode = "7080001"

regionName = "XI"

provinceName = "ProvinceX"

municipalityName = "TownX"

districtName = "BRGX"

votingCenter = ""SCHOOL"

precint = " , BRGX,"0054A"

You need the withIgnoreSurroundingSpaces() optione here. All other settings could be remain DEFAULT .

    final Reader in = new StringReader("7080001, XI, ProvinceX, TownX, BRGX, \"SHOOL, BRGX\", \"0054A,0055A,0055B,0055C\" ");
    final CSVFormat csvFileFormat = CSVFormat.DEFAULT.withIgnoreSurroundingSpaces();

    for (CSVRecord record: csvFileFormat.parse(in)) {
        for (String field: record) {
            System.out.println("\"" + field + "\"");
        }
        System.out.println();
    }

The output is

"7080001"
"XI"
"ProvinceX"
"TownX"
"BRGX"
"SHOOL, BRGX"
"0054A,0055A,0055B,0055C"

我可以使用库中的withQuote函数来做到这一点。

CSVFormat.EXCEL.newFormat(',').withQuote('"')

您是否已经尝试过使用CSVFormat.DEFAULT常量 ?-它用于遵循RFC 4180的 CSV文件。

以下方法对我有用:

CSVFormat.EXCEL.withQuote('"')

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM