简体   繁体   English

Java中的DOM解析器未对UTF-8中的引号进行编码

[英]DOM parser in java not encoding quotes in UTF-8

I am trying to use the code available from this tutorial :http://www.mkyong.com/java/how-to-create-xml-file-in-java-dom/ 我正在尝试使用本教程提供的代码:http://www.mkyong.com/java/how-to-create-xml-file-in-java-dom/

I've pasted the code below as well, the problem it seems to encode all the predef characters <,> and & etc. but not single or double quotes (" and '). I'd really appreciate a fix. Also the code below has an edit to make the resultant xml appear properly formatted 我也粘贴了下面的代码,这个问题似乎是对所有predef字符<,>和&等进行编码的问题,但是没有对单引号或双引号(“和')进行编码。我真的很感谢修复。下面进行了编辑,以使生成的xml格式正确

More specifically: 进一步来说:

     import java.io.File;
   import javax.xml.parsers.DocumentBuilder;
       import javax.xml.parsers.DocumentBuilderFactory;
     import javax.xml.parsers.ParserConfigurationException;
        import javax.xml.transform.Transformer;
      import javax.xml.transform.TransformerException;
    import javax.xml.transform.TransformerFactory;
      import javax.xml.transform.dom.DOMSource;
        import javax.xml.transform.stream.StreamResult;

       import org.w3c.dom.Attr;
   import org.w3c.dom.Document;
   import org.w3c.dom.Element;

 public class WriteXMLFile {

public static void main(String argv[]) {

  try {

    DocumentBuilderFactory docFactory = DocumentBuilderFactory.newInstance();
    DocumentBuilder docBuilder = docFactory.newDocumentBuilder();

    // root elements
    Document doc = docBuilder.newDocument();
    Element rootElement = doc.createElement("company");
    doc.appendChild(rootElement);

    // staff elements
    Element staff = doc.createElement("Staff");
    rootElement.appendChild(staff);

    // set attribute to staff element
    Attr attr = doc.createAttribute("id");
    attr.setValue("1");
    staff.setAttributeNode(attr);

    // shorten way
    // staff.setAttribute("id", "1");

    // firstname elements
    Element firstname = doc.createElement("firstname");
    firstname.appendChild(doc.createTextNode("yong"));
    staff.appendChild(firstname);

    // lastname elements
    Element lastname = doc.createElement("lastname");
    lastname.appendChild(doc.createTextNode("mook kim"));
    staff.appendChild(lastname);

    // nickname elements
    Element nickname = doc.createElement("nickname");
    nickname.appendChild(doc.createTextNode("mkyong"));
    staff.appendChild(nickname);

    // salary elements
    Element salary = doc.createElement("salary");
    salary.appendChild(doc.createTextNode("100000"));
    staff.appendChild(salary);

    // write the content into xml file
    TransformerFactory transformerFactory = TransformerFactory.newInstance();
    Transformer transformer = transformerFactory.newTransformer();
            transformer.setOutputProperty(OutputKeys.INDENT, "yes");
    DOMSource source = new DOMSource(doc);
    StreamResult result = new StreamResult(new File("C:\\file.xml"));

    // Output to console for testing
    // StreamResult result = new StreamResult(System.out);

    transformer.transform(source, result);

    System.out.println("File saved!");

  } catch (ParserConfigurationException pce) {
    pce.printStackTrace();
  } catch (TransformerException tfe) {
    tfe.printStackTrace();
  }
}

} }

I think your code works fine. 我认为您的代码工作正常。 Put a double quote in an attribute value and see what happens. 在属性值中加上双引号,然后看看会发生什么。

Read section 2.4 of the XML specification. 阅读XML规范的2.4节 Production 14 of the grammar 语法制作14

[14]    CharData       ::=      [^<&]* - ([^<&]* ']]>' [^<&]*)

tells you that character data can be any (valid XML) character except '<' and '&' (or the ']]>' sequence). 告诉您字符数据可以是除“ <”和“&”(或“]]>”序列之外的任何(有效XML)字符。 It is not strictly necessary to escape '>', although recommended. 虽然建议不要使用'>'转义。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM