jmg/crawley

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jmg/crawley)

jmg / crawley

Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.

☆194

Alternatives and similar repositories for crawley

Users that are interested in crawley are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qinxuye / cola
View on GitHub
A high-level distributed crawling framework.
☆1,500Jul 31, 2022Updated 3 years ago
matiasb / demiurge
View on GitHub
PyQuery-based scraping micro-framework.
☆118Jan 14, 2022Updated 4 years ago
weizetao / spider-roach
View on GitHub
分布式定向抓取集群
☆71Sep 4, 2017Updated 8 years ago
lorien / grab
View on GitHub
Web Scraping Framework
☆2,461Sep 19, 2025Updated 10 months ago
DanMcInerney / mailspy
View on GitHub
Catch IMAP/POP passwords and see incoming and outgoing messages
☆16Sep 15, 2013Updated 12 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
scrapinghub / scrapy-mosquitera
View on GitHub
Restrict crawl and scraping scope using matchers.
☆26Jun 8, 2016Updated 10 years ago
jeffkit / SOAPpy
View on GitHub
☆25May 7, 2020Updated 6 years ago
pubyun / puppet
View on GitHub
puppet frame to manage our server
☆17Jun 7, 2012Updated 14 years ago
TeamHG-Memex / scrapy-kafka-export
View on GitHub
Scrapy extension which writes crawled items to Kafka
☆31Apr 8, 2026Updated 3 months ago
scrapinghub / scmongo
View on GitHub
MongoDB extensions for Scrapy
☆44Oct 2, 2014Updated 11 years ago
gisce / sentry-irc
View on GitHub
A plugin for Sentry that logs errors to an IRC room.
☆19Nov 18, 2016Updated 9 years ago
indygemma / indygo
View on GitHub
pastescript template for a complete django project with pip+virtualenv, fabric, a gevent-based wsgi server and various helpers scripts. R…
☆13Sep 29, 2010Updated 15 years ago
willroberts / berrystats
View on GitHub
Flask/Jinja2 web app to report various stats about a Raspberry Pi
☆26Sep 27, 2022Updated 3 years ago
scrapinghub / portia
View on GitHub
Visual scraping for Scrapy
☆9,505Jun 26, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mitsuhiko / python-juggernaut
View on GitHub
Python client library for juggernaut
☆25Dec 6, 2011Updated 14 years ago
scrapinghub / scrapylib
View on GitHub
Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)
☆33Feb 22, 2018Updated 8 years ago
ooici / elasticpy
View on GitHub
Python client for ElasticSearch
☆17Jul 14, 2015Updated 11 years ago
llonchj / django-tastypie-elasticsearch
View on GitHub
ElasticSearch support for django-tastypie
☆28Nov 2, 2013Updated 12 years ago
amitu / djangothis
View on GitHub
SimpleHTTPServer with Django steroids
☆49Mar 10, 2015Updated 11 years ago
scrapy / scrapely
View on GitHub
A pure-python HTML screen-scraping library
☆1,884Apr 4, 2022Updated 4 years ago
retresco / Spyder
View on GitHub
A Python web crawler using Tornado and ZeroMQ
☆139May 9, 2012Updated 14 years ago
quokkaproject / flask-htmlbuilder
View on GitHub
Builds HTML from Python (recovered from local installation since original was deleted)
☆11Dec 26, 2022Updated 3 years ago
Gidsy / django-geoip-utils
View on GitHub
GeoIp data and helper function. Facilitates install and handling of the datasets
☆20Jul 25, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
atupal / ccrawler
View on GitHub
A distrubuted crawler ues celery.
☆15Oct 15, 2014Updated 11 years ago
jespino / django-lot
View on GitHub
Django Login over Token
☆25Jun 23, 2021Updated 5 years ago
playfire / django-dynamic-subdomains
View on GitHub
Dynamic (and static) subdomain support for Django
☆53Feb 26, 2013Updated 13 years ago
mbraak / django_pony_forms
View on GitHub
Django pony forms
☆17Jul 7, 2022Updated 4 years ago
malcolmt / django-multidb-patterns
View on GitHub
Demonstration code and slides for a talk about Django's multi-database support. Originally presented at DjangoCon-US, September 2010.
☆26Sep 27, 2010Updated 15 years ago
heroku / redo
View on GitHub
pipelined erlang redis client
☆19Jan 17, 2019Updated 7 years ago
buriy / django-containers
View on GitHub
"Containers" are html-only widgets for Django. Invaluable for complex designs.
☆21May 17, 2010Updated 16 years ago
benoitc / hroute
View on GitHub
simple HTTP proxy based on tproxy
☆27May 5, 2011Updated 15 years ago
douban / brownant
View on GitHub
Brownant is a web data extracting framework.
☆157Mar 3, 2017Updated 9 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kenkam / msgbrd
View on GitHub
Message board on Flask and Redis
☆18Jun 8, 2013Updated 13 years ago
alexanderGugel / arc-js
View on GitHub
An Adaptive Replacement Cache (ARC) written in JavaScript.
☆11Jul 18, 2015Updated 11 years ago
tleyden / open-ocr-client
View on GitHub
Client library for OpenOCR
☆32Dec 3, 2014Updated 11 years ago
mitsuhiko / speaklater
View on GitHub
Lazy strings for Python
☆61Jan 7, 2016Updated 10 years ago
ericmoritz / riak_crdt
View on GitHub
A Riak loader for CRDTs
☆22Dec 1, 2011Updated 14 years ago
muhuk / django-inviting
View on GitHub
Registration through invitations
☆58Mar 23, 2014Updated 12 years ago
howie6879 / ruia
View on GitHub
Async Python 3.6+ web scraping micro-framework based on asyncio
☆1,740Jul 1, 2023Updated 3 years ago