jmg / crawleyLinks
Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.
☆189Updated 2 years ago
Alternatives and similar repositories for crawley
Users that are interested in crawley are comparing it to the libraries listed below
Sorting:
- Python library of web-related functions☆411Updated 2 months ago
 - Web Crawling UI and HTTP API, based on Scrapy and Tornado☆160Updated 2 years ago
 - Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆32Updated 7 years ago
 - Useful test spiders for Scrapy☆185Updated 5 years ago
 - Celery integration for Flask (SINCE CELERY 3.0 THIS IS NO LONGER NEEDED)☆200Updated 11 years ago
 - ☆143Updated 9 years ago
 - Sourcecode for the bf3 developer news aggregator.☆84Updated 14 years ago
 - Scrapy extension to control spiders using JSON-RPC☆299Updated 6 years ago
 - A Python web crawler using Tornado and ZeroMQ☆140Updated 13 years ago
 - Mongodb support for scrapy☆101Updated 8 years ago
 - PyQuery-based scraping micro-framework.☆118Updated 3 years ago
 - A Redis client library for Twisted Python☆127Updated 3 years ago
 - A Python Library for Simple Models and Containers Persisted in Redis☆300Updated 9 years ago
 - python elasticsearch client☆361Updated 3 years ago
 - Torneira is a lightweight and rapid web framework build on top of Tornado☆69Updated 12 years ago
 - an HTTP resource kit for Python☆403Updated 4 years ago
 - A scrapy pipeline which send items to Elastic Search server☆98Updated 7 years ago
 - A Django based search engine powered by CouchDB, celery and whoosh.☆49Updated 9 years ago
 - MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…☆358Updated 4 years ago
 - asynchronous python driver for mongo☆340Updated 2 weeks ago
 - [abandoned] python port of arc90's readability bookmarklet☆542Updated 14 years ago
 - Python/JavaScript bridge module, making use of Mozilla's spidermonkey JavaScript implementation.☆304Updated 8 years ago
 - Provides simple but efficient admin UI.☆125Updated 10 years ago
 - Tinman is a Tornado support package including an application wrapper/runner and a set of handy decorators.☆183Updated 11 years ago
 - A middleware to use random user agent in Scrapy crawler.☆33Updated 12 years ago
 - MongoDB python logging handler. Python centralized logging made easy.☆197Updated 12 years ago
 - Greenlet-based event I/O Framework for Python☆579Updated 10 years ago
 - Requests + futures = <3 - a grequests fork, origonal code: https://github.com/kennethreitz/grequests☆63Updated 10 years ago
 - Scrapy Middleware to set a random User-Agent for every Request.☆202Updated 6 years ago
 - Turns python objects into mongo objects and vice versa☆137Updated last year