我想web爬取，但是有的項目爬到了，有的項目沒有爬到。我不知道原因

Question

我在 python 中使用 BeautifulSoup 來抓取網站。

addrs , a_earths被爬取時，末尾points = soup.select('.addr_point') This section 無法被爬取。 我不知道原因（網頁圖像中的紅色虛線框）

以下是我正在使用的代碼塊：

import urllib.parse
from bs4 import BeautifulSoup
import re

url = 'http://www.dooinauction.com/auction/ca_list.php'

req = urllib.request.Request(url) #
html = urllib.request.urlopen(req).read()
soup = BeautifulSoup(html, 'html.parser') 

tots = soup.select('div.title_left font') #total
tot = int(re.findall('\d+', tots[0].text)[0]) 
print(f'total : {tot}건')

url = f'http://www.dooinauction.com/auction/ca_list.php?total_record={tot}&search_fm_off=1&search_fm_off=1&start=0'
html = urllib.request.urlopen(url).read()
soup = BeautifulSoup(html, 'html.parser')

addrs = soup.select('.addr')  # crawling OK
a_earths = soup.select('.list_class.bold') #crawling OK
points = soup.select('.addr_point') #crawling NO
print()

網頁圖片

Answer 1

我瀏覽了您的網站，但似乎看不到 addr_points 部分。 我想也許這就是原因。

截屏：

我想web爬取，但是有的項目爬到了，有的項目沒有爬到。我不知道原因

問題描述

1 個解決方案

解決方案1
0 2020-02-24 09:05:36

我想web爬取，但是有的項目爬到了，有的項目沒有爬到。 我不知道原因

問題描述

1 個解決方案

解決方案1 0 2020-02-24 09:05:36

我想web爬取，但是有的項目爬到了，有的項目沒有爬到。我不知道原因

解決方案1
0 2020-02-24 09:05:36