Cannot concatenate 'str' and 'NoneType' objects using Scrapy

Question

I am trying to scrape information from this website https://www.gumtree.co.za , however I am not sure how to get the URL of the property.

This is what I have:

class GumtreeSpider(scrapy.Spider):
    name = "gumtree"
    start_urls = ['https://www.gumtree.co.za/s-house-rentals-flat-rentals-offered/cape-town/v1c9071l3100006p1',
                  'https://www.gumtree.co.za/s-houses-flats-for-sale/cape-town/v1c9074l3100006p1']

    def parse(self, response):

        for prop in response.css('div.tileV1'):
            link = 'https://www.gumtree.co.za' + prop.css('div.title  a.tile-title-text::attr(href)').get()

I have tried multiple combinations however I cannot seem to get it right. Any suggestions? Thank you!

Answer 1

What are you trying to scrape exactly in the page? The ad list in the carousel in the top?

If yes, you need to look at the Javascript XHR requests, they are loaded dynamically so you have to inspect the different requests to find what you want.

Cannot concatenate 'str' and 'NoneType' objects using Scrapy

Question

1 answers

solution1
0 2019-10-06 21:39:01

Cannot concatenate 'str' and 'NoneType' objects using Scrapy

Question

1 answers

solution1 0 2019-10-06 21:39:01

solution1
0 2019-10-06 21:39:01