jmg / crawley
Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.
☆189Updated 2 years ago
Alternatives and similar repositories for crawley
Users that are interested in crawley are comparing it to the libraries listed below
Sorting:
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆32Updated 7 years ago
- Useful test spiders for Scrapy☆185Updated 5 years ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆162Updated 2 years ago
- Scrapy extension to control spiders using JSON-RPC☆300Updated 5 years ago
- Python library of web-related functions☆406Updated last week
- PyQuery-based scraping micro-framework.☆116Updated 3 years ago
- PhantomJS Downloader for Scrapy, Yeah!☆94Updated 10 years ago
- Sourcecode for the bf3 developer news aggregator.☆84Updated 13 years ago
- Output scrapy statistics to graphite/carbon☆54Updated 12 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Updated 7 years ago
- Mongodb support for scrapy☆101Updated 8 years ago
- A Python web crawler using Tornado and ZeroMQ☆141Updated 13 years ago
- Tornado App Engine Blog☆98Updated 8 years ago
- Celery integration for Flask (SINCE CELERY 3.0 THIS IS NO LONGER NEEDED)☆200Updated 11 years ago
- Brownant is a web data extracting framework.☆159Updated 8 years ago
- ☆143Updated 9 years ago
- Scrapy project based on dirbot to show how to use Twisted's adbapi to store the scraped data in MySQL.☆117Updated 11 years ago
- MongoDB python logging handler. Python centralized logging made easy.☆198Updated 12 years ago
- A RabbitMQ Scheduler for Scrapy☆87Updated 2 years ago
- A Python Library for Simple Models and Containers Persisted in Redis☆300Updated 9 years ago
- A Python wrapper for working with Scrapyd's API.☆271Updated 9 months ago
- A Redis client library for Twisted Python☆128Updated 3 years ago
- Redis/Memcached sessions for Tornado☆97Updated 4 years ago
- Command line webpage screenshot and thubnail generator☆191Updated 3 years ago
- A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
- MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…☆357Updated 4 years ago
- asynchronous python driver for mongo☆338Updated last month
- [abandoned] python port of arc90's readability bookmarklet☆540Updated 13 years ago
- Requests + futures = <3 - a grequests fork, origonal code: https://github.com/kennethreitz/grequests☆63Updated 9 years ago
- 分布式定向抓取集群☆71Updated 7 years ago