簡體   English   中英

Web 使用 BeautifulSoup 在 Python 3 的頁面上抓取第二個表

[英]Web scraping the second of two tables on a page in Python 3 with BeautifulSoup

我正在研究我的 python 技能,我正在嘗試僅從該頁面https://en.wikipedia.org/wiki/List_of_Wales_national_rugby_union_team_results 刮取“結果”表。 我是 web 抓取的新手,誰能幫我提供一個優雅的解決方案來抓取結果 wikitable? 謝謝!

最簡單的方法是使用 Pandas 加載表:

import pandas as pd

tables = pd.read_html('https://en.wikipedia.org/wiki/List_of_Wales_national_rugby_union_team_results')

# print second table (index 1):
print(tables[1])

印刷:

                 Date                 Venue Home team   Away team       Score                     Competition    Winner Match report
0        7 March 2020    Twickenham Stadium   England       Wales       33–30                2020 Six Nations   England          BBC
1    22 February 2020  Principality Stadium     Wales      France       23–27                2020 Six Nations    France          BBC
2     8 February 2020         Aviva Stadium   Ireland       Wales       24–14                2020 Six Nations   Ireland          BBC
3     1 February 2020  Principality Stadium     Wales       Italy        42–0                2020 Six Nations     Wales          BBC
4    30 November 2019  Principality Stadium     Wales  Barbarians       43–33                      Tour Match     Wales          BBC
..                ...                   ...       ...         ...         ...                             ...       ...          ...
741    5 January 1884       Cardigan Fields   England       Wales    1G 2T–1G  1884 Home Nations Championship   England          NaN
742    8 January 1883         Raeburn Place  Scotland       Wales       3G–1G  1883 Home Nations Championship  Scotland          NaN
743  16 December 1882            St Helen's     Wales     England     0–2G 4T  1883 Home Nations Championship   England          NaN
744   28 January 1882        Lansdowne Road   Ireland       Wales     0–2G 2T                             NaN     Wales          NaN
745  19 February 1881    Richardson's Field   England       Wales  7G 6T 1D–0                             NaN   England          NaN

[746 rows x 8 columns]

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM