如何从页面获取HTML源代码？

Question

有没有办法使用JavaScript访问页面HTML源代码？

我知道我可以使用document.body.innerHTML但它只包含正文中的代码。 我想获取所有页面源代码，包括头部和身体标签及其内容，如果可能的话，还有html标签和doctype。 可能吗？

Answer 1

使用

document.documentElement.outerHTML

要么

document.documentElement.innerHTML

Answer 2

这可以使用XMLSerializer在单行中完成。

var generatedSource = new XMLSerializer().serializeToString(document);

哪个给String

<!DOCTYPE html><html><head>

<title>html - javascript page source code - Stack Overflow</title>
...

Answer 3

一种方法是使用XMLHttpRequest重新请求页面，然后您将从Web服务器逐字获取整个页面。

Answer 4

只要

页面源可以重新下载：

fetch(document.location.href)
    .then(response => response.text())
    .then(pageSource => /* ... */)

Answer 5

对于IE，您还可以使用：document.all [0] .outerHTML