从scrapy导出csv文件（不通过命令行） [英] Export csv file from scrapy (not via command line)

查看：1374 发布时间：2017/2/24 17:44:47 python csv scrapy export-to-csv scrapy-spider

本文介绍了从scrapy导出csv文件（不通过命令行）的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我已成功尝试从命令行导出我的项目到csv文件，如：

I successfully tried to export my items into a csv file from the command line like:

   scrapy crawl spiderName -o filename.csv

我的问题是：
什么是最简单的解决方案码？我需要这个，因为我从另一个文件中提取文件名。
结束方案应该是，我调用

My question is: What is the easiest solution to do the same in the code? I need this as i extract the filename from another file. End scenario should be, that i call

  scrapy crawl spiderName

并将项目写入filename.csv

and it writes the items into filename.csv

推荐答案

为什么不使用项目管道？

Why not use an item pipeline?

WriteToCsv.py

   import csv
   from YOUR_PROJECT_NAME_HERE import settings

   def write_to_csv(item):
       writer = csv.writer(open(settings.csv_file_path, 'a'), lineterminator='\n')
       writer.writerow([item[key] for key in item.keys()])

   class WriteToCsv(object):
        def process_item(self, item, spider):
            write_to_csv(item)
            return item

   ITEM_PIPELINES = { 'project.pipelines_path.WriteToCsv.WriteToCsv' : A_NUMBER_HIGHER_THAN_ALL_OTHER_PIPELINES}
   csv_file_path = PATH_TO_CSV

如果你想要将项目写入单独的csv以便单独的蜘蛛，你可以给你的蜘蛛一个CSV_PATH字段。然后在你的管道中使用你的spiders字段，而不是setttigs的路径。

If you wanted items to be written to separate csv for separate spiders you could give your spider a CSV_PATH field. Then in your pipeline use your spiders field instead of path from setttigs.

这个工作我在我的项目中测试了。

This works I tested it in my project.

HTH

http：/ /doc.scrapy.org/en/latest/topics/item-pipeline.html

这篇关于从scrapy导出csv文件（不通过命令行）的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

从scrapy导出csv文件（不通过命令行） [英] Export csv file from scrapy (not via command line)

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

从scrapy导出csv文件（不通过命令行） [英] Export csv file from scrapy (not via command line)

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭