简体   繁体   English

使用 python 和 bs4 抓取后的不同数据

[英]Different data after scraping with python and bs4

I'm trying to get the number of reviews on Amazon.我正在尝试获取亚马逊上的评论数量。 However, when I take the data it is different from that on the site.但是,当我获取数据时,它与网站上的数据不同。 (131 is after scraping and 655 from Amazon) I attach screenshots of the page and the one after the scraping. (131 是在抓取之后,655 来自亚马逊)我附上页面截图和抓取之后的截图。

131 reviews 131 条评论

655 reviews 655 条评论

From inspect element从检查元素

import bs4
import requests
import time


url3 = "https://www.amazon.it/dp/B076S8NSCD"

headers = {"User-Agent" : 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_6) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/13.0.5 Safari/605.1.15'}

res = requests.get(url3, headers = headers)

soup = bs4.BeautifulSoup(res.text, "html.parser")


reviews = soup.find(id = "acrCustomerReviewText").get_text()
print(reviews)

If you aren't using premium rotating residential proxies to scrape Amazon reviews there's a good chance this is a cloaking measure and your IP is flagged for sending too many requests.如果您没有使用高级轮换住宅代理来抓取亚马逊评论,那么这很可能是一种伪装措施,您的 IP 被标记为发送过多请求。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何在一次又一次地点击提交按钮后通过生成不同的文档 ID 将数据添加到 firebase 火库? Android Kotlin - How to add data to the firebase firestore by generating different document id after hitting submit button again and again? Android Kotlin Swiftui - 登录后获取数据 - Swiftui - Fetch data after login 在 BigQuery 中将不同行的数据合并为一个 - Concat data from different rows into one in BigQuery 使用不同的数据进行生产并开发 firebase 个站点 - Use different data for production and develop firebase sites Firebase Android - 如何以不同用户身份登录(使用 Google 登录后) - Firebase Android - how to login as different user (after signing in with Google) Python “[警告] 连接被 SSLError 中断后重试” - Python "[WARNING] Retrying after connection broken by SSLError" 将数据上传到 Firesbase 后 Alexa 技能没有响应 - Alexa skill not responding after uploading data to firesbase 谷歌登录后颤动添加用户数据 - flutter add user data after google sign in 达到 limit() 后 QuerySnapshot 删除数据 - QuerySnapshot removing data after limit() reached EC2 实例重启后数据消失 - Data gone after EC2 Instance reboot
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM