繁体   English   中英

Spring Jaxb2:如何在不读取内存的情况下将批处理数据追加到XML文件?

[英]Spring Jaxb2: How to append batch data to XML file with no reading it to memory?

我需要将数据批量写入xml。

有以下域对象:

@XmlRootElement(name = "country")
public class Country {
    @XmlElements({@XmlElement(name = "town", type = Town.class)})
    private Collection<Town> towns = new ArrayList<>();
    ....
}

和:

@XmlRootElement(name = "town")
public class Town {
    @XmlElement
    private String townName;
    // etc
}

我正在用Jaxb2编组对象。 配置如下:

marshaller = new Jaxb2Marshaller();
marshaller.setClassesToBeBound(Country.class, Town.class);

因为简单的封送在这里不能作为marhaller.marshall(fileName, country) -它使xml格式错误。

有没有一种方法可以让tweek marhaller生成一个文件,如果它不与所有已封装的数据一起存在,或者仅将其附加在xml文件的末尾,则可以创建文件?

另外,由于此文件可能很大,因此我不想读取内存中的整个文件,不要追加数据,然后再写入磁盘。

我已经使用StAX进行xml处理,因为它基于流,比DOM消耗更少的内存,并且与只能解析xml数据但不能写入xml的SAX相比,具有读取和写入的能力。

这是我想出的方法:

public enum StAXBatchWriter {
    INSTANCE;
    private static final Logger LOGGER = LoggerFactory.getLogger(StAXBatchWriter.class);

    public void writeUrls(File original, Collection<Town> towns) {
        XMLEventReader eventReader = null;
        XMLEventWriter eventWriter = null;
        try {
            String originalPath = original.getPath();
            File from = new File(original.getParent() + "/old-" + original.getName());
            boolean isRenamed = original.renameTo(from);
            if (!isRenamed)
                throw new IllegalStateException("Failed to rename file: " + original.getPath() + " to " + from.getPath());
            File to = new File(originalPath);

            XMLInputFactory inFactory = XMLInputFactory.newInstance();
            eventReader = inFactory.createXMLEventReader(new FileInputStream(from));

            XMLOutputFactory outFactory = XMLOutputFactory.newInstance();
            eventWriter = outFactory.createXMLEventWriter(new FileWriter(to));

            XMLEventFactory eventFactory = XMLEventFactory.newInstance();

            while (eventReader.hasNext()) {
                XMLEvent event = eventReader.nextEvent();
                eventWriter.add(event);
                if (event.getEventType() == XMLEvent.START_ELEMENT && event.asStartElement().getName().toString().contains("country")) {
                    for (Town town : towns) {
                        writeTown(eventWriter, eventFactory, town);
                    }
                }
            }
            boolean isDeleted = from.delete();
            if (!isDeleted)
                throw new IllegalStateException("Failed to delete old file: " + from.getPath());
        } catch (IOException | XMLStreamException e) {
            LOGGER.error(e.getMessage(), e);
            throw new RuntimeException(e);
        } finally {
            try {
                if (eventReader != null)
                    eventReader.close();
            } catch (XMLStreamException e) {
                LOGGER.error(e.getMessage(), e);
            }
            try {
                if (eventWriter != null)
                    eventWriter.close();
            } catch (XMLStreamException e) {
                LOGGER.error(e.getMessage(), e);
            }
        }
    }

    private void writeTown(XMLEventWriter eventWriter, XMLEventFactory eventFactory, Town town) throws XMLStreamException {
        eventWriter.add(eventFactory.createStartElement("", null, "town"));

        // write town id
        eventWriter.add(eventFactory.createStartElement("", null, "id"));
        eventWriter.add(eventFactory.createCharacters(town.getId()));
        eventWriter.add(eventFactory.createEndElement("", null, "id"));

        //write town name
        if (StringUtils.isNotEmpty(town.getName())) {
            eventWriter.add(eventFactory.createStartElement("", null, "name"));
            eventWriter.add(eventFactory.createCharacters(town.getName()));
            eventWriter.add(eventFactory.createEndElement("", null, "name"));
        }

        // write other fields

        eventWriter.add(eventFactory.createEndElement("", null, "town"));
    }
}

这不是最好的方法,尽管它基于流并且可以工作,但它有一些开销。 当添加批次时-必须重新读取旧文件。

可以选择在文件的某个位置附加数据(例如“在4行之后将数据附加到该文件”),这很好,但是似乎无法完成。

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM