繁体 English 中英

我如何让Scrapy爬进python项目？

[英]How can I get have Scrapy crawling inside a python project ?

原文 2016-03-20 20:25:45 7 1 python/ scrapy

我有一个个人项目，导致我使用Selenium以便从一对私人[邮件，密码]对获得公共URL地址。

我想在此URL上保存信息，然后按照Scrapy教程获得如何使用此工具进行操作。 但是，有没有一种方法可以在MyScrapClass.crawl()这样的Python项目中启动爬网，而不是使用linux命令scrapy crawl MyScrapProject ？

1 个解决方案

使用CrawlerProcess或CrawlerRunner类从python脚本中运行scrapy。

http://doc.scrapy.org/en/latest/topics/practices.html

例子来自于scrapy网站：

import scrapy
from scrapy.crawler import CrawlerProcess

class MySpider(scrapy.Spider):
    # Your spider definition
    ...

process = CrawlerProcess()
process.crawl(MySpider)
# the script will block here until the crawling is finished
process.start()

在Costom python脚本中从scrapy抓取网站后，如何获取网址列表？

[英]How we can get List of urls after crawling website from scrapy in costom python script?

如何在 Python Scrapy 上获取短信

[英]How can I get text on Python Scrapy

scrapy如何从scrapy项目中获取项目名称

[英]scrapy how to get project name from inside a scrapy project

我该如何解决这个错误？（python 爬行）

[英]How can I solve this error?(python crawling)

Python-Scrapy爬网

[英]Python - Scrapy crawling the web

抓痒的python CrawlSpider无法爬行

[英]scrapy python CrawlSpider not crawling

使用Python和Scrapy进行递归爬行

[英]recursive crawling with Python and Scrapy

我怎样才能在这个页面上爬行？我有一个特定的错误

[英]How can I crawling on this page? I have a specific error

如何在 python scrapy 中仅获取文本

[英]How can I get only text in python scrapy

在python中，如何获取返回隐藏元素的内容？

[英]In python, how can I get scrapy to return the contents of an element that is hidden?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 在Costom python脚本中从scrapy抓取网站后，如何获取网址列表？如何在 Python Scrapy 上获取短信 scrapy如何从scrapy项目中获取项目名称我该如何解决这个错误？（python 爬行） Python-Scrapy爬网抓痒的python CrawlSpider无法爬行使用Python和Scrapy进行递归爬行我怎样才能在这个页面上爬行？我有一个特定的错误如何在 python scrapy 中仅获取文本在python中，如何获取返回隐藏元素的内容？

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM