簡體 English 中英

在沒有BeautifulSoup的情況下，使用python提取網頁上鏈接的最簡單方法是什么？

[英]What's the easiest way to extract the links on a web page using python without BeautifulSoup?

原文 2010-12-11 00:09:53 3 2 python

我正在使用cygwin，但未安裝BeautifulSoup。

2 個解決方案

使用Python獲取HTML文件上所有<a>標記中href屬性的值

python，regex查找錨鏈接html

正則表達式，用於從HTML鏈接中提取URL

如果您不太在意性能，則可以使用正則表達式：

import re
linkre = re.compile(r"""href=["']([^"']+)["']""")
links = linkre.findall(your_html)

如果只想使用http：//鏈接中的鏈接，則將表達式更改為：

linkre = re.compile(r"""href=["']http:([^"']+)["']""")

或者，如果有可能您的鏈接周圍沒有html，則可以將“”作為可選。

使用python和BeautifulSoup從網頁檢索特定鏈接

[英]retrieve specific links from web page using python and BeautifulSoup

在Web上獲取Python腳本輸出的最簡單方法是什么？

[英]What's easiest way to get Python script output on the web?

使用BeautifulSoup從html頁面提取鏈接

[英]Extract links from html page using BeautifulSoup

使用 BeautifulSoup Python 從網頁中提取特定的 JS 值

[英]Extract specific JS value from web page using BeautifulSoup Python

使用python比較兩個網頁的最簡單方法是什么？

[英]What is the easiest way to compare two web pages using python?

使用BeautifulSoup從網頁檢索鏈接

[英]Retrieve links from web page using BeautifulSoup

無法在dirrectmirror網頁上提取帶有beautifulsoup4的鏈接

[英]cannot extract links with beautifulsoup4 on dirrectmirror web page

使用python從網頁中提取所有鏈接

[英]Extract all links from a web page using python

Python - 使用BeautifulSoup從URL列表中刪除文本的最簡單方法

[英]Python - Easiest way to scrape text from list of URLs using BeautifulSoup

在 Python 中轉義 HTML 的最簡單方法是什么？

[英]What's the easiest way to escape HTML in Python?

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 使用python和BeautifulSoup從網頁檢索特定鏈接在Web上獲取Python腳本輸出的最簡單方法是什么？使用BeautifulSoup從html頁面提取鏈接使用 BeautifulSoup Python 從網頁中提取特定的 JS 值使用python比較兩個網頁的最簡單方法是什么？使用BeautifulSoup從網頁檢索鏈接無法在dirrectmirror網頁上提取帶有beautifulsoup4的鏈接使用python從網頁中提取所有鏈接 Python - 使用BeautifulSoup從URL列表中刪除文本的最簡單方法在 Python 中轉義 HTML 的最簡單方法是什么？

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM