簡體 English 中英

如何從網站獲取某些鏈接，而不是所有鏈接？

[英]How do I get certain links from a website, but not all of them?

原文 2021-10-26 16:10:07 3 1 python/ beautifulsoup/ python-requests

這是我到目前為止所擁有的：

import requests
from bs4 import BeautifulSoup

def linkScraper():
    html = requests.get("https://www.bbc.com/").text
    soup = BeautifulSoup(html, 'html.parser')
    
    for link in soup.find_all('a'):
        print(link.get('href'))

但這會打印網站上的每個鏈接。 我如何配置它以提供指向出現在 BBC 主頁上的文章的鏈接？

1 個解決方案

您可以使用列表理解對其進行過濾：

links = [link for link in soup.find_all('a') if link.startswith('https://www.bbc.com/')]

我想使用python從某個網頁獲取所有鏈接

[英]I want to get all links from a certain webpage using python

如何使用 pandas 從網站獲取所有表格

[英]How do I get all the tables from a website using pandas

網頁抓取：如何獲取“href”鏈接並從中抓取表格

[英]Web Scraping: How do I get 'href' links and scrape table from them

如何使用meta從網站中的所有鏈接獲取數據

[英]How to use meta to get data from all the links in a website

如何從不斷變化的網站獲取包含短語的所有鏈接

[英]How to get all links containing a phrase from a changing website

如何從網站 Python 中的所有鏈接中提取評論

[英]How can I extract comments from all the links in a website Python

Scrapy 從任何網站獲取所有鏈接

[英]Scrapy get all links from any website

我如何排除某些鏈接被抓取

[英]How do i exclude certain links from being scraped

如何從 python 中的多個網頁獲取所有鏈接？

[英]How do I get all the links from multiple web pages in python?

如何使用python中的Scrapy抓取網站以獲取網站中的所有鏈接？

[英]How to crawl a website to get all the links in a website using Scrapy in python?

暫無

暫無

聲明:本站的技術帖子網頁，遵循CC BY-SA 4.0協議，如果您需要轉載，請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

相關問題 我想使用python從某個網頁獲取所有鏈接如何使用 pandas 從網站獲取所有表格網頁抓取：如何獲取“href”鏈接並從中抓取表格如何使用meta從網站中的所有鏈接獲取數據如何從不斷變化的網站獲取包含短語的所有鏈接如何從網站 Python 中的所有鏈接中提取評論 Scrapy 從任何網站獲取所有鏈接我如何排除某些鏈接被抓取如何從 python 中的多個網頁獲取所有鏈接？如何使用python中的Scrapy抓取網站以獲取網站中的所有鏈接？

相關標簽

粵ICP備18138465號 © 2020-2024 STACKOOM.COM