簡體   English   中英

抓取標簽屬性 BeautifulSoup

[英]Scraping tag attribute BeautifulSoup

我會從這個頁面抓取所有的data-oid標簽,但在輸出中什么都不返回

代碼

url = 'https://www.betexplorer.com/soccer/south-korea/k-league-2/bucheon-fc-1995-jeonnam/EDwej14E/'

response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')
table = soup.find('table', class_='table-main')

for rows in table.find_all('tr')[1:]:
    for row in rows.find_all('td'):
        data = row.get_attrs['data-oid']
        print(data)

在此處輸入圖片說明

頁面的零件表部分是通過 JavaScript 從外部 URL 加載的。 要獲取帶有data-oid=參數的標簽的data-oid= ,您可以使用以下示例:

import requests
from bs4 import BeautifulSoup

url = "https://www.betexplorer.com/soccer/south-korea/k-league-2/bucheon-fc-1995-jeonnam/EDwej14E/"
match_id = "EDwej14E"  # <-- this is the last part of URL

api_url = "https://www.betexplorer.com/match-odds/{}/1/1x2/".format(match_id)

headers = {"Referer": "https://www.betexplorer.com"}
data = requests.get(api_url, headers=headers).json()
soup = BeautifulSoup(data["odds"], "html.parser")

# your code:
table = soup.find("table", class_="table-main")
for rows in table.find_all("tr")[1:]:
    for row in rows.select("td[data-oid]"):
        data = row["data-oid"]
        print(data)

印刷:


...

4kqjpxv464x0xc6aif
4kqjpxv464x0xc6aie
4kqjpxv498x0x0
4kqjpxv464x0xc6aif
4kqjpxv464x0xc6aie
4kqjpxv498x0x0
4kqjpxv464x0xc6aif

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM