简体   繁体   中英

Make scrapy export to csv

I want to use scrapy in the following way

from scrapy.crawler import CrawlerProcess

process = CrawlerProcess({
    'USER_AGENT': 'Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1)' })

process.crawl(my_super_scraper) 
process.start()

It works with my_super_scraper , but I cannot figure out how to export to CSV. I cannot find it in the documentation either.

You need to set FEED_FORMAT and FEED_URI parameters as follow:

from scrapy.crawler import CrawlerProcess

process = CrawlerProcess({
    'USER_AGENT': 'Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 5.1)', 
    'FEED_FORMAT': 'CSV', 
    'FEED_URI': 'file:///tmp/export.csv',
})

process.crawl(my_super_craper)
process.start()

More information about feed export here https://docs.scrapy.org/en/latest/topics/feed-exports.html

This post shows how to export to JSON format: Scrapy process.crawl() to export data to json

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM