简体繁体中英

How to extract data from the HTML using python?

原文 2021-06-07 16:58:15 8 1 python/ web-scraping

How to extract the data from from the html?

from urllib.request import urlopen
url = 'http://book.ponniyinselvan.in/part-1/chapter-1.html'
page = urlopen(url)

getting HTTPError: HTTP Error 403: Forbidden

I am trying to extract the data into CSV file.

1 answers

You can use this example how to save the text into a CSV file:

import csv
import requests
from bs4 import BeautifulSoup

url = "http://book.ponniyinselvan.in/part-1/chapter-1.html"

with open("data.csv", "w") as f_out:
    writer = csv.writer(f_out)

    soup = BeautifulSoup(requests.get(url).content, "html.parser")
    text = soup.section.get_text(strip=True, separator="\n")

    writer.writerow(["Chapter", "Text"])
    writer.writerow([1, text])

Saves data.csv (screenshot from LibreOffice):

How to extract JSON data from the HTML data using Python?

How to extract the data from encoded HTML class using python

How can I extract data from a html tag using Python?

how can we extract data from the HTML file using python

Extract Data from a html table using Python

Extract data from HTML table using Python

How to extract data from webpage using python

how to extract the data from image using python

How to Extract Data from tmdB using Python

Extract specific data from html parsing using python

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to extract JSON data from the HTML data using Python? How to extract the data from encoded HTML class using python How can I extract data from a html tag using Python? how can we extract data from the HTML file using python Extract Data from a html table using Python Extract data from HTML table using Python How to extract data from webpage using python how to extract the data from image using python How to Extract Data from tmdB using Python Extract specific data from html parsing using python

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM