用python将某个网站的HTML保存在一个txt文件中

Question

I need save the HTML code of any website in a txt file, is a very easy exercise but I have doubts with this because a have a function that do this:我需要将任何网站的 HTML 代码保存在一个 txt 文件中，这是一个非常简单的练习，但我对此表示怀疑，因为有一个功能可以做到这一点：

import urllib.request

def get_html(url):
    f=open('htmlcode.txt','w')
    page=urllib.request.urlopen(url)
    pagetext=page.read() ## Save the html and later save in the file
    f.write(pagetext)
    f.close()

But this doesn't work.但这不起作用。

Answer 1

Easiest way would be to use urlretrieve :最简单的方法是使用urlretrieve ：

import urllib

urllib.urlretrieve("http://www.example.com/test.html", "test.txt")

For Python 3.x the code is as follows:对于 Python 3.x，代码如下：

import urllib.request    
urllib.request.urlretrieve("http://www.example.com/test.html", "test.txt")

Answer 2

I use Python 3 .我使用Python 3 。
pip install requests - after install requests library you can save a webpage in txt file. pip install requests - 安装requests库后，您可以将网页保存在 txt 文件中。

import requests

url = "https://stackoverflow.com/questions/24297257/save-html-of-some-website-in-a-txt-file-with-python"

r = requests.get(url)
with open('file.txt', 'w') as file:
    file.write(r.text)

用python将某个网站的HTML保存在一个txt文件中

问题描述

2 个解决方案

解决方案1
22 已采纳 2014-06-19 01:18:30

解决方案2
7 2019-09-26 06:47:45

用python将某个网站的HTML保存在一个txt文件中

问题描述

2 个解决方案

解决方案1 22 已采纳 2014-06-19 01:18:30

解决方案2 7 2019-09-26 06:47:45

解决方案1
22 已采纳 2014-06-19 01:18:30

解决方案2
7 2019-09-26 06:47:45