簡體 English 中英

使用 urllib 和 BeautifulSoup 通過 Python 從網絡檢索信息

[英]Using urllib and BeautifulSoup to retrieve info from web with Python

原文 2010-04-15 16:34:29 1 2 python/ web-scraping/ beautifulsoup/ urllib2

我可以使用urllib獲取html頁面，並使用BeautifulSoup解析html頁面，看起來我必須生成要從BeautifulSoup讀取的文件。

import urllib                                       
sock = urllib.urlopen("http://SOMEWHERE") 
htmlSource = sock.read()                            
sock.close()                                        
--> write to file

有沒有辦法在不從 urllib 生成文件的情況下調用 BeautifulSoup？

2 個解決方案

from BeautifulSoup import BeautifulSoup

soup = BeautifulSoup(htmlSource)

無需寫入文件：只需傳入 HTML 字符串即可。 也可以直接傳遞urlopen返回的對象：

f = urllib.urlopen("http://SOMEWHERE") 
soup = BeautifulSoup(f)

您可以打開 url，下載 html，然后使用gazpacho一次性解析它：

from gazpacho import Soup
soup = Soup.get("https://www.example.com/")

嘗試使用urllib2和BeautifulSoup從網站中的模板檢索數據

[英]Trying to retrieve data from a template in a website using urllib2 and BeautifulSoup

使用 beautifulSoup 和 urllib 進行網頁抓取

[英]Web scraping using beautifulSoup and urllib

使用python和BeautifulSoup從網頁檢索特定鏈接

[英]retrieve specific links from web page using python and BeautifulSoup

使用BeautifulSoup從網頁檢索鏈接

[英]Retrieve links from web page using BeautifulSoup

Python - 使用 BeautifulSoup 從網站提取信息

[英]Python - Extracting info from website using BeautifulSoup

使用urllib和BeautifulSoup從python 3中的HTML表中獲取數據

[英]Get data from HTML table in python 3 using urllib and BeautifulSoup

哪個 Python 版本用於使用 BeautifulSoup 和 urllib？

[英]Which Python version to use for using BeautifulSoup and urllib?

使用 BeautifulSoup 從博客中抓取所有時事通訊信息

[英]Web scraping all the newsletter info from the blog using BeautifulSoup

使用 python 和 BeautifulSoup 從網頁中檢索鏈接，而不是選擇 3 個鏈接並運行 4 次

[英]retrieve links from web page using python and BeautifulSoup than select 3 link and run it 4 times

使用 beautifulsoup 和 urllib 從 Json 抓取

[英]Scraping from Json using beautifulsoup and urllib

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 嘗試使用urllib2和BeautifulSoup從網站中的模板檢索數據使用 beautifulSoup 和 urllib 進行網頁抓取使用python和BeautifulSoup從網頁檢索特定鏈接使用BeautifulSoup從網頁檢索鏈接 Python - 使用 BeautifulSoup 從網站提取信息使用urllib和BeautifulSoup從python 3中的HTML表中獲取數據哪個 Python 版本用於使用 BeautifulSoup 和 urllib？使用 BeautifulSoup 從博客中抓取所有時事通訊信息使用 python 和 BeautifulSoup 從網頁中檢索鏈接，而不是選擇 3 個鏈接並運行 4 次使用 beautifulsoup 和 urllib 從 Json 抓取

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM