jmg / crawley
Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.
☆186Updated last year
Related projects ⓘ
Alternatives and complementary repositories for crawley
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆31Updated 6 years ago
- Python library of web-related functions☆392Updated 3 weeks ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆161Updated 2 years ago
- MongoDB python logging handler. Python centralized logging made easy.☆198Updated 11 years ago
- ☆143Updated 8 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Updated 6 years ago
- Celery integration for Flask (SINCE CELERY 3.0 THIS IS NO LONGER NEEDED)☆201Updated 10 years ago
- PyQuery-based scraping micro-framework.☆113Updated 2 years ago
- Scrapy extension to control spiders using JSON-RPC☆297Updated 5 years ago
- Mongodb support for scrapy☆101Updated 7 years ago
- Useful test spiders for Scrapy☆183Updated 4 years ago
- Brownant is a web data extracting framework.☆159Updated 7 years ago
- A Redis client library for Twisted Python☆128Updated 2 years ago
- A Python web crawler using Tornado and ZeroMQ☆141Updated 12 years ago
- A Python Library for Simple Models and Containers Persisted in Redis☆300Updated 8 years ago
- an HTTP resource kit for Python☆405Updated 3 years ago
- Redis/Memcached sessions for Tornado☆98Updated 3 years ago
- A implementation of SOAP web services for tornado web server☆92Updated 9 years ago
- PhantomJS Downloader for Scrapy, Yeah!☆94Updated 10 years ago
- Python connector for ElasticSearch - the pythonic way to use ElasticSearch☆606Updated 3 years ago
- Python libraries for XML/JSON RPC using the Tornado framework.☆159Updated last year
- Fast binary [de]serialization of native python types☆33Updated 14 years ago
- Tinman is a Tornado support package including an application wrapper/runner and a set of handy decorators.☆187Updated 10 years ago
- Torneira is a lightweight and rapid web framework build on top of Tornado☆69Updated 11 years ago
- Output scrapy statistics to graphite/carbon☆54Updated 11 years ago
- A tornado-powered python library that provides asynchronous access to elasticsearch☆96Updated 6 years ago
- MongoDB extensions for Scrapy☆44Updated 10 years ago
- A middleware to use random user agent in Scrapy crawler.☆33Updated 11 years ago
- [abandoned] python port of arc90's readability bookmarklet☆537Updated 13 years ago
- Phantompy is a headless WebKit engine with powerful pythonic api build on top of Qt5 Webkit☆614Updated 7 years ago