rmax/scrapy-boilerplate

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rmax/scrapy-boilerplate)

rmax / scrapy-boilerplate

Small set of utilities to simplify writing Scrapy spiders.

☆50

Alternatives and similar repositories for scrapy-boilerplate

Users that are interested in scrapy-boilerplate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

julien-duponchelle / scrapy-dot
View on GitHub
Export a graph of link between crawled items by scrapy in dot file format.
☆26Sep 24, 2011Updated 14 years ago
julien-duponchelle / scrapy-couchdb
View on GitHub
A scrapy pipeline for couchdb
☆17Sep 10, 2011Updated 14 years ago
rochacbruno-archive / scrapy_model
View on GitHub
A helper to create web scrapers using scrapy selector in a Model based structure
☆57Dec 26, 2022Updated 3 years ago
darthbear / scrapy-proxynova
View on GitHub
Use scrapy with a list of proxies generated from proxynova.com
☆39Jan 3, 2013Updated 13 years ago
scrapinghub / webpager
View on GitHub
Paginating the web
☆37Feb 11, 2014Updated 12 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
svetlyak40wt / scrapy-useragents
View on GitHub
A middleware to use random user agent in Scrapy crawler.
☆33Dec 15, 2012Updated 13 years ago
pravin / scrapy-tutorial
View on GitHub
This is the code for the tutorial titled "Writing a spider in 10 mins using Scrapy" which can be found in the Weblogs section
☆16Nov 28, 2013Updated 12 years ago
scrapinghub / mdr
View on GitHub
A python library detect and extract listing data from HTML page.
☆110May 5, 2017Updated 9 years ago
mikedingjan-archive / django-grunt-template
View on GitHub
Django project template, integrated GruntJS watch tasks
☆22May 25, 2013Updated 13 years ago
scrapinghub / scaws
View on GitHub
Extensions for using Scrapy on Amazon AWS
☆32Dec 5, 2012Updated 13 years ago
eldarion / django-trending
View on GitHub
an app to track trending objects where "trending" is defined as views per day
☆23Aug 1, 2014Updated 11 years ago
lamby / django-ctemplate
View on GitHub
Compile Django templates to C
☆24Jun 14, 2017Updated 9 years ago
rmax / scrapy-inline-requests
View on GitHub
A decorator to write coroutine-like spider callbacks.
☆109Dec 26, 2022Updated 3 years ago
rmax / scrapydo
View on GitHub
Crochet-based blocking API for Scrapy.
☆47Feb 24, 2017Updated 9 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
TeamHG-Memex / MaybeDont
View on GitHub
A component that tries to avoid downloading duplicate content
☆28Apr 8, 2026Updated 3 months ago
tommilligan / pyqubes
View on GitHub
QubesOS dom0 automation in Python
☆13Aug 3, 2017Updated 8 years ago
dgnest / django-provision
View on GitHub
provisioning a VPS to run Django with Ansible
☆15May 13, 2025Updated last year
christabor / skaffold
View on GitHub
A python auto-scaffolding tool for MVC applications like Django.
☆18Jul 14, 2015Updated 11 years ago
kennethreitz-archive / procs
View on GitHub
Python, Processes, and Prana.
☆226Mar 10, 2015Updated 11 years ago
clasense4 / scrapy-bhinneka-crawler
View on GitHub
Scraping bhinneka.com, just for fun
☆14Jan 28, 2013Updated 13 years ago
scrapinghub / flatson
View on GitHub
Tool to flatten stream of JSON-like objects, configured via schema
☆33Oct 19, 2019Updated 6 years ago
andreruffert / emoji-clarification
View on GitHub
Clarify your words with emojis
☆12Aug 25, 2016Updated 9 years ago
Gidsy / django-threaded-messages
View on GitHub
Rewrite of django-messages to support Facebook-style threaded messaging
☆15Mar 24, 2014Updated 12 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
un33k / django-geoaware
View on GitHub
Django GeoAware provides a middleware as well as a context processor for including GeoIP related info in the session and/or the context o…
☆15Feb 26, 2014Updated 12 years ago
arugifa / europython-2018-workshop
View on GitHub
Material for my workshop at EuroPython 2018
☆12May 2, 2019Updated 7 years ago
FabioTacke / PubliclyVerifiableSecretSharing
View on GitHub
An implementation of Publicly Verifiably Secret Sharing (PVSS) in Swift.
☆13Nov 8, 2017Updated 8 years ago
lnxpgn / scrapy_multiple_spiders
View on GitHub
Using multiple spiders in a Scrapy project
☆10Aug 7, 2015Updated 10 years ago
scholrly / lucene-querybuilder
View on GitHub
A DSL to build Lucene text queries in Python.
☆38Jan 5, 2017Updated 9 years ago
OlivierBlanvillain / crawler
View on GitHub
Blog crawler for the blogforever project.
☆23Jan 31, 2014Updated 12 years ago
barszczmm / django-easy-userena
View on GitHub
Simplified fork of django-userena with less dependencies and hopefully easier setup and customization process
☆15Jun 6, 2012Updated 14 years ago
elena / django-news-podcast
View on GitHub
Django News/Update Podcast
☆24Jul 4, 2022Updated 4 years ago
BuzzFeedNews / namestand
View on GitHub
A Python library for standardizing lists of names, especially database/CSV column–names.
☆23Dec 16, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
zgohr / mezzanine-foundation
View on GitHub
Zurb Foundation theme for Mezzanine, a Django CMS
☆19Mar 20, 2013Updated 13 years ago
danielholmstrom / flask-alchemyview
View on GitHub
Flask ModelView for SQLAlchemy declarative models
☆16Jun 5, 2015Updated 11 years ago
debrouwere / pollster
View on GitHub
Pollster polls for share counts of URLs at regular intervals.
☆47Nov 21, 2015Updated 10 years ago
scrapy-plugins / scrapy-pagestorage
View on GitHub
A scrapy extension to store requests and responses information in storage service
☆27Mar 11, 2022Updated 4 years ago
evansd / django-envsettings
View on GitHub
One-stop shop for configuring 12-factor Django apps
☆10Aug 13, 2015Updated 10 years ago
scrapinghub / product-extraction-benchmark
View on GitHub
☆16Apr 10, 2026Updated 3 months ago
thoas / django-sequere
View on GitHub
A Django application to implement a follow system and a timeline using multiple backends (db, redis, etc.)
☆57Apr 16, 2017Updated 9 years ago