使用 python 和 bs4 抓取后的不同数据

Question

I'm trying to get the number of reviews on Amazon.我正在尝试获取亚马逊上的评论数量。 However, when I take the data it is different from that on the site.但是，当我获取数据时，它与网站上的数据不同。 (131 is after scraping and 655 from Amazon) I attach screenshots of the page and the one after the scraping. （131 是在抓取之后，655 来自亚马逊）我附上页面截图和抓取之后的截图。

131 reviews 131 条评论

655 reviews 655 条评论

From inspect element从检查元素

import bs4
import requests
import time


url3 = "https://www.amazon.it/dp/B076S8NSCD"

headers = {"User-Agent" : 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_6) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/13.0.5 Safari/605.1.15'}

res = requests.get(url3, headers = headers)

soup = bs4.BeautifulSoup(res.text, "html.parser")


reviews = soup.find(id = "acrCustomerReviewText").get_text()
print(reviews)

Answer 1

If you aren't using premium rotating residential proxies to scrape Amazon reviews there's a good chance this is a cloaking measure and your IP is flagged for sending too many requests.如果您没有使用高级轮换住宅代理来抓取亚马逊评论，那么这很可能是一种伪装措施，您的 IP 被标记为发送过多请求。

使用 python 和 bs4 抓取后的不同数据

问题描述

1 个解决方案

解决方案1
1 2022-10-23 11:34:46

使用 python 和 bs4 抓取后的不同数据

问题描述

1 个解决方案

解决方案1 1 2022-10-23 11:34:46

解决方案1
1 2022-10-23 11:34:46