Insutanto/scrapy-distributed

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Insutanto/scrapy-distributed)

Insutanto / scrapy-distributed

A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.

☆62

Alternatives and similar repositories for scrapy-distributed

Users that are interested in scrapy-distributed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

php-tui / cli-parser
View on GitHub
Type-safe CLI argument parser
☆11Jul 29, 2024Updated last year
Mopolo / MagicConstant
View on GitHub
PHP Magic Constants, even more powerful than an Enum
☆14Jan 26, 2026Updated 6 months ago
DEVSENSE / Parsers
View on GitHub
☆25Jun 24, 2026Updated last month
ConlinH / aio-scrapy
View on GitHub
Implement scrapy with asyncio
☆72Updated this week
markmelnic / stealthenium
View on GitHub
Run selenium undetected.
☆32Mar 6, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ydf0509 / realtime_web_logs
View on GitHub
pip install realtime_web_logs 文件日志实时显示到web页面。附带全系统硬盘的文件浏览下载功能。支持日志显示自动滚动和暂停。
☆13Jul 1, 2020Updated 6 years ago
q-m / scrapyd-k8s
View on GitHub
Scrapyd on container infrastructure
☆16May 29, 2026Updated 2 months ago
bgspiders / sekiro_js
View on GitHub
js_rpc通杀版本
☆14Oct 9, 2021Updated 4 years ago
elfgzp / python-consul-demo
View on GitHub
🐍本项目为 Consul 的使用 Demo
☆13Dec 8, 2022Updated 3 years ago
scravy / pysparkextra
View on GitHub
☆10Jun 29, 2021Updated 5 years ago
LiuXingMing / Scrapy_Redis_Bloomfilter
View on GitHub
基于Redis的Bloomfilter去重，并将其扩展到Scrapy框架。
☆347Feb 26, 2023Updated 3 years ago
use-py / usepy
View on GitHub
一个简单方便的Python工具包
☆17Mar 15, 2026Updated 4 months ago
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,226Nov 7, 2023Updated 2 years ago
scrapedia / scrapy-pipelines
View on GitHub
A collection of pipelines for Scrapy
☆16Apr 27, 2026Updated 3 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
SachinAgarwal1337 / Laravel-Generators
View on GitHub
This package includes Artisan Commands to create directory structure.
☆10Mar 26, 2022Updated 4 years ago
simonw / llm-templates-github
View on GitHub
Research prototype for new register_template_loaders LLM plugin hook
☆20Apr 7, 2025Updated last year
cevoaustralia / glue-vscode
View on GitHub
Local Development of AWS Glue with Docker and Visual Studio Code
☆14Nov 29, 2021Updated 4 years ago
gumblex / chinesename
View on GitHub
Generate Chinese name according to statistic model.
☆13Sep 4, 2015Updated 10 years ago
kingname / SifouSource
View on GitHub
Python 业务开发常见错误案例集配套源代码
☆10Dec 19, 2020Updated 5 years ago
mic1on / puppeteer-render
View on GitHub
基于puppeteer和NodeJS的服务端渲染，提供Docker一键部署及API调用接口。
☆19Aug 30, 2022Updated 3 years ago
Gerapy / Gerapy
View on GitHub
Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js
☆3,506Jul 4, 2026Updated 3 weeks ago
Gerapy / GerapyPlaywright
View on GitHub
Downloader Middleware to support Playwright in Scrapy & Gerapy
☆111Mar 6, 2022Updated 4 years ago
cilame / yidun_icon
View on GitHub
易盾图标识别，包含定位以及点选顺序的识别，定位 pytorch 模型大小只有3M，执行速度极快。内附代码和测试用例，直接使用即可测试。定位准确率 95% 以上，识别用的sift算法，测试通过率大概 50%。
☆33Jun 2, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
aws-samples / redshift-streaming-ingestion-patterns
View on GitHub
This is a collecton of CDK projects to show how to load data from streaming services into Amazon Redshift.
☆13Sep 10, 2024Updated last year
clemfromspace / scrapy-puppeteer
View on GitHub
Scrapy + Puppeteer
☆110Jun 11, 2021Updated 5 years ago
sonata-project / SonataTranslationBundle
View on GitHub
SonataTranslationBundle
☆77May 3, 2026Updated 2 months ago
Germey / JMeterMonitor
View on GitHub
JMeter Tester with Influxdb and Grafana
☆14Apr 10, 2020Updated 6 years ago
Python3WebSpider / ScrapyPyppeteer
View on GitHub
Scrapy Pyppeteer Demo
☆12Jul 30, 2020Updated 5 years ago
zwjjiaozhu / gitchat_download
View on GitHub
gitchat课程下载工具
☆30Jul 11, 2020Updated 6 years ago
mic1on / fastrpc
View on GitHub
这是一个基于 FastAPI 的浏览器 RPC 服务端
☆59Aug 17, 2023Updated 2 years ago
stav / scrapybox
View on GitHub
Scrapy GUI
☆12Feb 26, 2021Updated 5 years ago
bytebuff / aioScrapy
View on GitHub
基于asyncio与aiohttp的异步协程爬虫框架欢迎Star
☆35Oct 25, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JSREI / javascript-window-listener-library
View on GitHub
javascript逆向开发基础组件，监听window的变化
☆18Feb 11, 2024Updated 2 years ago
kairyou / user-agent-switcher
View on GitHub
User Agent Switcher + for Chrome
☆11Apr 9, 2022Updated 4 years ago
azwpayne / JsHookScript
View on GitHub
JsHookScript, All the hook scripts I know
☆19Jun 19, 2026Updated last month
MuggleK / CrawlersTools
View on GitHub
Tools for Crawlers
☆22Dec 25, 2023Updated 2 years ago
delatbabel / viewpages
View on GitHub
Support view/rendering of Laravel pages and templates from a database.
☆12Nov 22, 2017Updated 8 years ago
j-m-li / lldb-trace-call
View on GitHub
Trace function calls using lldb
☆13Jul 5, 2021Updated 5 years ago
hacklee / laravel5-multi-auth
View on GitHub
laravel5 multi auth
☆11Mar 4, 2015Updated 11 years ago