簡體   English   中英

如何為 BeautifulSoup 循環 URL output?

[英]How do you loop the URL output for BeautifulSoup?

所以我想提取每個數據塊,中間有一個空格,現在我已經設置了循環,但是當我 go 打印它時,它只提取了第二個玩家配置文件。 知道如何解決這個問題嗎?

如果它工作正常,output 將是:

Greg Oden C  #20
Born: Jan 22, 1988 (33 years old)
Birthplace/Hometown: Buffalo, New York
Nationality: United States
Height: 7-0 (213cm)     Weight: 273 (124kg)
Website: http://www.gregoden52.com/
Current NBA Status: Unrestricted Free Agent
Agent: Bill Duffy
Draft Entry: 2007 NBA Draft
Early Entry Info: 2007 Early Entrant
Drafted: Round 1, Pick 1, Portland Trail Blazers
Pre-Draft Team: Ohio State (Fr)
High School: Lawrence North High School [Indianapolis, Indiana]
AAU Team: Spiece Indy Heat

Carl Landry F
Current Team: N/A
Born: Sep 19, 1983 (37 years old)
Birthplace/Hometown: Milwaukee, Wisconsin  
Nationality: United States
Height: 6-9 (206cm)     Weight: 248 (112kg)
Hand: Right
Website: https://carllandry.com/
@CarlLandry
Current NBA Status: Unrestricted Free Agent
Agent: Mark Bartelstein, Reggie Brown
Draft Entry: 2007 NBA Draft
Drafted: Round 2, Pick 1, Seattle SuperSonics
Draft Rights Trade: SEA to HOU, Jun 28, 2007
Pre-Draft Team: Purdue (Sr)
High School: Vincent High School [Milwaukee, Wisconsin]

這是代碼:

import csv ;import requests
from bs4 import BeautifulSoup
import csv
import re

url_list = ['https://basketball.realgm.com/player/player/Summary/1',
            'https://basketball.realgm.com/player/player/Summary/2']

for url in url_list:
    r = requests.get(url)
    soup = BeautifulSoup(r.text, 'html.parser')

player = soup.find_all('div', class_= 'wrapper clearfix container')[0]


playerprofile = re.sub(r'\n\s*\n', r'\n', player.get_text().strip(), flags=re.M)

print(playerprofile)
import csv
import requests
from bs4 import BeautifulSoup
import csv
import re

url_list = ['https://basketball.realgm.com/player/player/Summary/1',
            'https://basketball.realgm.com/player/player/Summary/2']

for url in url_list:
    r = requests.get(url)
    soup = BeautifulSoup(r.text, 'html.parser')

    player = soup.find_all('div', class_='wrapper clearfix container')[0]

    playerprofile = re.sub(
        r'\n\s*\n', r'\n', player.get_text().strip(), flags=re.M)

    print(playerprofile + "\n")

此代碼的工作方式如您所需的 output 所示,似乎您的代碼中播放器的解析和打印發生在循環完成后。 它應該在循環的每次迭代中完成,所以你可以將它縮進循環中。

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM