使用 -beautiful-soup 獲取 python 表列中的 href 鏈接

Question

我有這個數據，你可以看到：

[<td><span></span></td>, <td><span></span></td>, <td><a class="cmc-link" href="/currencies/renbtc/"><span class="circle"></span><span>renBTC</span><span class="crypto-symbol">RENBTC</span></a></td>, <td><span>$<!-- -->61947.68</span></td>, <td><span></span></td>]

我想提取href鏈接，正如您在此處看到的/currencies/renbtc/ 。

這是我的代碼：

from bs4 import BeautifulSoup
import requests
try:
    r = requests.get('https://coinmarketcap.com/')
    soup = BeautifulSoup(r.text, 'lxml')

    
    table = soup.find('table', class_='cmc-table')
    for row in table.tbody.find_all('tr'):    
        # Find all data for each column
         columns = row.find_all('td')
         print(columns)
         
except requests.exceptions.RequestException as e:
    print(e)

但結果是整個列。

Answer 1

迭代列表中的<td> ，如果<td>有<a> （ if td.a ），則.get('href')的td.a ：

from bs4 import BeautifulSoup
import requests
try:
    r = requests.get('https://coinmarketcap.com/')
    soup = BeautifulSoup(r.text, 'lxml')

    table = soup.find('table', class_='cmc-table')

    for row in table.tbody.find_all('tr'):
        # Find all data for each column
        columns = row.find_all('td')
        for td in columns:
            if td.a:
                print(td.a.get('href'))
                # theoretically for performance you can
                # break
                # here to stop the loop if you expect only one anchor link per `td`

except requests.exceptions.RequestException as e:
    print(e)

Answer 2

對包含<a>列中的元素進行操作，選擇它並獲取其href ：

link = columns[2].a['href']

例子

from bs4 import BeautifulSoup
import requests
try:
    r = requests.get('https://coinmarketcap.com/')
    soup = BeautifulSoup(r.text, 'lxml')

    
    table = soup.find('table', class_='cmc-table')
    for row in table.tbody.find_all('tr'):    
        # Find all data for each column
         columns = row.find_all('td')
         link = columns[2].a['href']
         print(link)
         
except requests.exceptions.RequestException as e:
    print(e)

使用 -beautiful-soup 獲取 python 表列中的 href 鏈接

問題描述

2 個解決方案

解決方案1
2 2021-11-01 10:28:23

解決方案2
1 已采納 2021-11-01 10:28:11

例子

使用 -beautiful-soup 獲取 python 表列中的 href 鏈接

問題描述

2 個解決方案

解決方案1 2 2021-11-01 10:28:23

解決方案2 1 已采納 2021-11-01 10:28:11

例子

解決方案1
2 2021-11-01 10:28:23

解決方案2
1 已采納 2021-11-01 10:28:11