简体繁体中英

Export scrapy objects into one file per item

原文 2014-11-12 20:21:36 4 1 python/ web-scraping/ scrapy

I'm using scrapy to get contents of some webpages. Is there a way to configure scrapy so that it exports each dataline into a separate file?

1 answers

You can yield items in your spider to return multiple items to be processed in your pipeline.

class SomeSpider(Spider):

  ...

  def parse(self, response):
    # some code to parse the webpage

    for some_line in webpage:
        item = YourItem()
        # parse items

        yield item

This will return multiple items for one scraped page. Then just specify your pipeline to write each item to a separate file.

class SomePipeline(object):

  ...      

  def process_item(self, item, spider):
      with open('file.txt', 'w') as f:

          # format your item into a line here

          f.write(line)

Scrapy: one row per item

Scrapy Spider only generates one item per loop

Scrapy output items - multiple parse methods, one row per item

How can I implement Scrapy Pause/Resume when scraping from multiple pages per item into one CSV file?

Scrapy - How to export a cvs file with item key in header

How to enable overwriting a file everytime in scrapy item export?

Scrapy - Different pipeline per Item

Multiple pages per item in Scrapy

Scrapy output one CSV file per start url

export scrapy to text file

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Scrapy: one row per item Scrapy Spider only generates one item per loop Scrapy output items - multiple parse methods, one row per item How can I implement Scrapy Pause/Resume when scraping from multiple pages per item into one CSV file? Scrapy - How to export a cvs file with item key in header How to enable overwriting a file everytime in scrapy item export? Scrapy - Different pipeline per Item Multiple pages per item in Scrapy Scrapy output one CSV file per start url export scrapy to text file

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM