簡體   English   中英

美麗的湯和蟒蛇刮板

[英]Scraping table with beautiful soup and python

我想從此網頁的嵌套表#timeTable檢索tr

我已經嘗試了以下方法,但是它給出了一個空數組。

nlg_timetable_url = "https://navlib.forth-crs.gr/italian_b2c/npgres.exe?func=TT&ReservationType=npgres.exe%3FPM%3DBO&Leg1i=PRJ&Leg1ii=BEV&Leg1Date=26%2F02%2F2019&TotalPassengers=1&TotalPassengersHuman=1&TotalPassengersAcce=0&TotalVehicles=0"
headers = {'user-agent':'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/62.0.3202.94 Safari/537.3'}
request = urllib.request.Request(nlg_timetable_url,headers=headers)
html = urllib.request.urlopen(request).read()
soup = BeautifulSoup(html,'html.parser')
ngl_timetable_table = list(soup.select('#timeTable tr'))
print(ngl_timetable_table)

輸出量

[]

我會使用請求模塊

import requests
from bs4 import BeautifulSoup
nlg_timetable_url = "https://navlib.forth-crs.gr/italian_b2c/npgres.exe?func=TT&ReservationType=npgres.exe%3FPM%3DBO&Leg1i=PRJ&Leg1ii=BEV&Leg1Date=26%2F02%2F2019&TotalPassengers=1&TotalPassengersHuman=1&TotalPassengersAcce=0&TotalVehicles=0"
headers = {'user-agent':'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/62.0.3202.94 Safari/537.3'}
res = requests.get(nlg_timetable_url,headers=headers)
soup = BeautifulSoup(res.content,'html.parser')
for item in soup.select('#timeTable tr'):
    print(item.text)

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM