scrapy無法進行Request（）回調

Question

我試圖用Scrapy制作遞歸解析腳本，但是Request()函數不調用回調函數suppose_to_parse() ，也不調用回調值中提供的任何函數。 我嘗試了不同的變化，但沒有一個工作。 在哪里挖？

from scrapy.http import Request
from scrapy.spider import BaseSpider
from scrapy.selector import HtmlXPathSelector



class joomler(BaseSpider):
    name = "scrapy"
    allowed_domains = ["scrapy.org"]
    start_urls = ["http://blog.scrapy.org/"]


    def parse(self, response):
        print "Working... "+response.url
        hxs = HtmlXPathSelector(response)
        for link in hxs.select('//a/@href').extract():
            if not link.startswith('http://') and not link.startswith('#'):
               url=""
               url=(self.start_urls[0]+link).replace('//','/')
               print url
               yield Request(url, callback=self.suppose_to_parse)


    def suppose_to_parse(self, response):
        print "asdasd"
        print response.url

Answer 1

我不是專家，但我嘗試了你的代碼，我認為問題不在請求上，生成的url似乎被打破了，如果你將一些url添加到列表並迭代它們並產生帶回調的Request ，它工作正常。

Answer 2

將yield移到if語句之外：

for link in hxs.select('//a/@href').extract():
    url = link
    if not link.startswith('http://') and not link.startswith('#'):
        url = (self.start_urls[0] + link).replace('//','/')

    print url
    yield Request(url, callback=self.suppose_to_parse)

scrapy無法進行Request（）回調

問題描述

2 個解決方案

解決方案1
1 2013-03-22 21:56:17

解決方案2
1 已采納 2013-03-22 22:28:35

scrapy無法進行Request（）回調

問題描述

2 個解決方案

解決方案1 1 2013-03-22 21:56:17

解決方案2 1 已采納 2013-03-22 22:28:35

解決方案1
1 2013-03-22 21:56:17

解決方案2
1 已采納 2013-03-22 22:28:35