简体   繁体   中英

Selenium raw page source

I am trying to get the source code of a particular site with the help of Selenium with:

Python code:

driver.page_source 

But it returns it after it has been encoded.

The raw file:

<html>
<head>
    <title>&#x41;&#x41;&#x41;&#x41;&#x41;&#x41;&#x41;&#x41;</title>
</head>
<body>

</body>

When press 'View page source' inside Chrome, I saw the correct source raw without encoding.

How can this be achieved?

You can try using Javascript instead of Python builtin code to get the page source.

javascriptPageSource = driver.execute_script("return document.body.outerHTML;")

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM