由于一些奇怪的字符串，BeautifulSoup 找不到我的桌子

Question

Hi I have been trying to get a table from wunderground using BeautifulSoup but it just doesn't work.嗨，我一直在尝试使用 BeautifulSoup 从 wunderground 获取一张桌子，但它不起作用。

I think it could be for the starnge string next to the table header but i can´t fix it.我认为它可能是表格标题旁边的星形字符串，但我无法修复它。

Here is my code:这是我的代码：

from bs4 import BeautifulSoup
import requests

url='https://www.wunderground.com/history/daily/LEMD/date/2020-10-21'

html_content = requests.get(url).text

soup = BeautifulSoup(html_content, "html.parser")

table = soup.find("table", {"class": "mat-table cdk-table mat-sort ng-star-inserted"})

table_data = table.tbody.find_all("tr")

and the error:和错误：

Traceback (most recent call last):
  File "weather_poc.py", line 12, in <module>
    table_data = table.tbody.find_all("tr")
AttributeError: 'NoneType' object has no attribute 'tbody'

Answer 1

The data you see is loaded from external URL via JavaScript.您看到的数据是通过 JavaScript 从外部 URL 加载的。 You can use requests / json module to load it.您可以使用requests / json模块来加载它。 For example:例如：

import json
import requests
import pandas as pd


url = 'https://api.weather.com/v1/location/LEMD:9:ES/observations/historical.json?apiKey=6532d6454b8aa370768e63d6ba5a832e&units=e&startDate=20201021&endDate=20201021'
data = requests.get(url).json()

# uncomment this line to print all data:
# print(json.dumps(data, indent=4))

df = pd.json_normalize(data['observations'])
df.to_csv('data.csv', index=False)

Creates data.csv (screenshot from LibreOffice):创建data.csv （来自 LibreOffice 的截图）：

由于一些奇怪的字符串，BeautifulSoup 找不到我的桌子

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-10-22 15:41:38

由于一些奇怪的字符串，BeautifulSoup 找不到我的桌子

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-10-22 15:41:38

解决方案1
1 已采纳 2020-10-22 15:41:38