Is it possible to crawl multiple start_urls list simultaneously

Question

I have 3 URL files all of them have same structure so same spider can be used for all lists. A special need is that all three need to be crawled simultaneously.

is it possible to crawl them simultaneously without creating multiple spiders?

I believe this answer

start_urls = ["http://example.com/category/top/page-%d/" % i for i in xrange(4)] + \
["http://example.com/superurl/top/page-%d/" % i for i in xrange(55)]

in Scrap multiple urls with scrapy only joins two list, but not to run them at the same time.

Thanks very much

Answer 1

use start_requests instead of start_urls ... this will work for u

class MySpider(scrapy.Spider):
name = 'myspider'

def start_requests(self):
    for page in range(1,20):
        yield self.make_requests_from_url('https://www.example.com/page-%s' %page)

Is it possible to crawl multiple start_urls list simultaneously

Question

1 answers

solution1
1 2015-09-11 10:58:37

Is it possible to crawl multiple start_urls list simultaneously

Question

1 answers

solution1 1 2015-09-11 10:58:37

solution1
1 2015-09-11 10:58:37