为什么Jsoup在解析HTML元素时会移动它？

Question

Here is my sample input HTML Codes: 这是我的示例输入HTML代码：

<html>
<head>
<object></object>
</head>
<body>
</body>
</html>

Below is the output when parsed using Jsoup: 以下是使用Jsoup解析时的输出：

<html>
 <head> 
 </head>
 <body>
  <object></object>    
 </body>
</html>

Question: Why did Jsoup move the <object> tag from the <head> to the <body> ? 问题：为什么Jsoup为什么将<object>标记从<head>移到<body> ？

Answer 1

This is correct behaviour since <object> must appear inside the body. 这是正确的行为，因为<object> 必须出现在体内。

HTML Tag HTML标签

[...] [...]

Tips and Notes 提示和注意事项

Note: An element must appear inside the element . 注意：元素必须出现在element内部 。 The text between the and is an alternate text, for browsers that do not support this tag. 和之间的文本是备用文本，用于不支持此标记的浏览器。

http://www.w3schools.com/tags/tag_object.asp http://www.w3schools.com/tags/tag_object.asp

If you want the object within the head, you can use the XmlParser instead: 如果要将对象放在头部，则可以使用XmlParser代替：

    final String html = "<html>\n"
            + "<head>\n"
            + "<object></object>\n"
            + "</head>\n"
            + "<body>\n"
            + "</body>\n"
            + "</html>";

    Document doc = Jsoup.parse(html, "", Parser.xmlParser());
    //                                   |<-------------->|

为什么Jsoup在解析HTML元素时会移动它？

问题描述

1 个解决方案

解决方案1
3 2015-03-26 15:30:33

HTML Tag HTML标签

Tips and Notes 提示和注意事项

为什么Jsoup在解析HTML元素时会移动它？

问题描述

1 个解决方案

解决方案1 3 2015-03-26 15:30:33

HTML Tag HTML标签

Tips and Notes 提示和注意事项

解决方案1
3 2015-03-26 15:30:33