[英]AttributeError in Scrapy web scraping
我写了一个 scrapy 代码来抓取网站但出现属性错误。 我是 web 抓取的新手,请指导我如何解决此错误。 这是错误消息: AttributeError: 'str' object has no attribute 'xpath'
这是我的代码:
# -*- coding: utf-8 -*-
import scrapy
class ShopSpider(scrapy.Spider):
name = 'shop'
allowed_domains = ['https://www.redbubble.com']
start_urls = ['https://www.redbubble.com/shop/shower-curtains/']
def parse(self, response):
products = response.xpath("//a[@class='styles__link--2sYi3']").get()
for product in products:
product_url = product.xpath(".//img[@class='styles__image--2CwxX styles__productImage--3ZNPD styles__rounded--1lyoH styles__fluid--3dxe-']/@src").get()
title = name = product.xpath(".//div[@class='styles__box--206r9 styles__paddingRight-0--fzRHs']/div[@class='styles__textContainer--1xehi styles__disableLineHeight--3n9Fg styles__nowrap--2Vk3A']/span/text()").get()
yield {
'name' : title,
'url' : product_url
}
错误很明显
您正在尝试从字符串中调用xpath
方法
请更换
products = response.xpath("//a[@class='styles__link--2sYi3']").get()
至
products = response.xpath("//a[@class='styles__link--2sYi3']")
这是对我有用的代码。 您收到 str 错误,因为您不能在字符串后使用 response.xpath。 您需要直接在for循环中使用。 这是我使用的代码。 您也可以删除允许的域。
import scrapy
class ShopSpider(scrapy.Spider):
name = 'shop'
start_urls = ['https://www.redbubble.com/shop/shower-curtains/']
def parse(self, response):
for product in response.xpath("//a[@class='styles__link--2sYi3']"):
product_url = product.xpath(
".//img[@class='styles__image--2CwxX styles__productImage--3ZNPD styles__rounded--1lyoH styles__fluid--3dxe-']/@src").get()
title = product.xpath(".//div[@class='styles__box--206r9 styles__paddingRight-0--fzRHs']/div[@class='styles__textContainer--1xehi styles__disableLineHeight--3n9Fg styles__nowrap--2Vk3A']/span/text()").get()
yield {
"title": title,
"url": product_url
}
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.