scrapy/scrapyd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapy/scrapyd)

scrapy / scrapyd

A service daemon to run Scrapy spiders

☆3,097

Alternatives and similar repositories for scrapyd

Users that are interested in scrapyd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scrapy / scrapyd-client
View on GitHub
Command line client for Scrapyd server
☆772Feb 27, 2026Updated 4 months ago
my8100 / scrapydweb
View on GitHub
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.…
☆3,409Feb 19, 2025Updated last year
rmax / scrapy-redis
View on GitHub
Redis-based components for Scrapy.
☆5,644May 19, 2026Updated 2 months ago
Gerapy / Gerapy
View on GitHub
Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js
☆3,503Jul 4, 2026Updated 2 weeks ago
scrapy-plugins / scrapy-splash
View on GitHub
Scrapy+Splash for JavaScript integration
☆3,229Feb 11, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DormyMo / SpiderKeeper
View on GitHub
admin ui for scrapy/open source scrapinghub
☆2,768May 4, 2023Updated 3 years ago
scrapy / scrapy
View on GitHub
Scrapy, a fast high-level web crawling & scraping framework for Python.
☆63,293Updated this week
scrapinghub / scrapyrt
View on GitHub
HTTP API for Scrapy spiders
☆882Jun 29, 2026Updated 3 weeks ago
djm / python-scrapyd-api
View on GitHub
A Python wrapper for working with Scrapyd's API.
☆269Jul 31, 2024Updated last year
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,226Nov 7, 2023Updated 2 years ago
scrapinghub / splash
View on GitHub
Lightweight, scriptable browser as a service with an HTTP API
☆4,190Aug 2, 2024Updated last year
scrapy-plugins / scrapy-jsonrpc
View on GitHub
Scrapy extension to control spiders using JSON-RPC
☆299Aug 26, 2019Updated 6 years ago
scrapinghub / portia
View on GitHub
Visual scraping for Scrapy
☆9,506Jun 26, 2024Updated 2 years ago
aivarsk / scrapy-proxies
View on GitHub
Random proxy middleware for Scrapy
☆1,669Oct 1, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
scrapinghub / frontera
View on GitHub
A scalable frontier for web crawlers
☆1,332Jun 6, 2025Updated last year
geekan / scrapy-examples
View on GitHub
Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.
☆3,254Nov 3, 2023Updated 2 years ago
holgerd77 / django-dynamic-scraper
View on GitHub
Creating Scrapy scrapers via the Django admin interface
☆1,158Feb 19, 2022Updated 4 years ago
binux / pyspider
View on GitHub
A Powerful Spider(Web Crawler) System in Python.
☆16,796Apr 30, 2024Updated 2 years ago
alecxe / scrapy-fake-useragent
View on GitHub
Random User-Agent middleware based on fake-useragent
☆688Sep 18, 2023Updated 2 years ago
scrapinghub / spidermon
View on GitHub
Scrapy Extension for monitoring spiders execution.
☆561May 28, 2026Updated last month
clemfromspace / scrapy-selenium
View on GitHub
Scrapy middleware to handle javascript pages using selenium
☆952Apr 13, 2026Updated 3 months ago
crawlab-team / crawlab
View on GitHub
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架
☆12,249Feb 10, 2026Updated 5 months ago
scrapy-plugins / scrapy-djangoitem
View on GitHub
Scrapy extension to write scraped items using Django models
☆502Oct 15, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
my8100 / logparser
View on GitHub
A tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.
☆93Jan 5, 2025Updated last year
gnemoug / distribute_crawler
View on GitHub
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
☆3,243Apr 18, 2017Updated 9 years ago
AccordBox / awesome-scrapy
View on GitHub
A curated list of awesome packages, articles, and other cool resources from the Scrapy community.
☆561Dec 28, 2022Updated 3 years ago
TeamHG-Memex / scrapy-rotating-proxies
View on GitHub
use multiple proxies with Scrapy
☆775Apr 8, 2026Updated 3 months ago
scrapy / parsel
View on GitHub
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
☆1,343Jul 16, 2026Updated last week
celery / celery
View on GitHub
Distributed Task Queue (development branch)
☆28,715Updated this week
sebdah / scrapy-mongodb
View on GitHub
MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…
☆358Apr 6, 2021Updated 5 years ago
scrapy-plugins / scrapy-deltafetch
View on GitHub
Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls
☆276Feb 26, 2025Updated last year
scrapy / scrapely
View on GitHub
A pure-python HTML screen-scraping library
☆1,884Apr 4, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
scrapy-plugins / scrapy-zyte-smartproxy
View on GitHub
Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
☆363May 4, 2026Updated 2 months ago
scrapy-plugins / scrapy-playwright
View on GitHub
🎭 Playwright integration for Scrapy
☆1,434Updated this week
twisted / twisted
View on GitHub
Event-driven networking engine written in Python.
☆5,971Updated this week
fake-useragent / fake-useragent
View on GitHub
Up-to-date simple useragent faker with real world database
☆4,054Mar 29, 2026Updated 3 months ago
scrapinghub / scrapylib
View on GitHub
Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)
☆33Feb 22, 2018Updated 8 years ago
mher / flower
View on GitHub
Real-time monitor and web admin for Celery distributed task queue
☆7,221Updated this week
redis / redis-py
View on GitHub
Redis Python client
☆13,600Updated this week