使用 python 抓取 web 頁面

Question

這是代碼，它產生我想要的東西，但不是我想要的 output 結果

   import requests
    from bs4 import BeautifulSoup
    url = 'https://en.wikipedia.org/wiki/2020_coronavirus_pandemic_in_Florida'

    fl = requests.get(url)
    fl_soup = BeautifulSoup(fl.text, 'html.parser')
    block = fl_soup.findAll('td', {'class': 'bb-04em'})

    for name in fl_soup.findAll('td', {'class': 'bb-04em'}):
        print(name.text)

output

2020-04-21

27,869(+3.0%)

867

我想要這樣的 output 2020-04-21 27,869(+3.0%) 867

Answer 1

以下應該做你想要的：

import requests
from bs4 import BeautifulSoup
url = 'https://en.wikipedia.org/wiki/2020_coronavirus_pandemic_in_Florida'

fl = requests.get(url)
fl_soup = BeautifulSoup(fl.text, 'html.parser')

div_with_table = fl_soup.find('div', {'class': 'barbox tright'})
table = div_with_table.find('table')

for row in table.findAll('tr'):
    for cell in row.findAll('td', {'class': 'bb-04em'}):
        print(cell.text, end=' ')
    print()  # new line for each row

Answer 2

在訪問每個<td>之前，嘗試通過每個<tr>獲取數據，您將獲得每個表行的信息。 然后你可以在<td>內搜索或任何你想要的。

Answer 3

對於最后一個打印語句，包括 end 參數。 默認情況下，打印語句有 end='\n'

print(name.text, end=' ')

這將為您提供所需的 output。

使用 python 抓取 web 頁面

問題描述

3 個解決方案

解決方案1
0 2020-04-22 14:14:00

解決方案2
0 2020-04-22 15:25:22

解決方案3
0 2020-04-22 21:34:17

使用 python 抓取 web 頁面

問題描述

3 個解決方案

解決方案1 0 2020-04-22 14:14:00

解決方案2 0 2020-04-22 15:25:22

解決方案3 0 2020-04-22 21:34:17

解決方案1
0 2020-04-22 14:14:00

解決方案2
0 2020-04-22 15:25:22

解決方案3
0 2020-04-22 21:34:17