Distributed crawling/scraping, Kafka And Redis based components for Scrapy
☆45Nov 13, 2020Updated 5 years ago
Alternatives and similar repositories for scrapy-kafka-redis
Users that are interested in scrapy-kafka-redis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kafka-based components for Scrapy☆78Apr 10, 2018Updated 7 years ago
- scrapy-redis-expiredupefilter是基于scrapy-redis修改来的一款scrapy分布式爬虫框架,它支持为请求指纹设置生命周期,请求指纹生命周期结束后将在不影响其他指纹的情况下自动清除。☆10Aug 6, 2019Updated 6 years ago
- Scrapy and Kafka☆14Feb 7, 2018Updated 8 years ago
- 个人博客☆13Feb 2, 2023Updated 3 years ago
- Nintendo Switch 云游戏!☆12May 8, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Kafka event forwarder build on top of Elastic Beats platform☆11Apr 29, 2019Updated 6 years ago
- ☆11Jun 14, 2020Updated 5 years ago
- A RabbitMQ Scheduler for Scrapy☆87Aug 9, 2022Updated 3 years ago
- 对微信网页授权获取用户信息的封装☆10Jul 30, 2015Updated 10 years ago
- 一个基于 HttpCanary 和 Python 的爬虫项目☆21May 2, 2023Updated 2 years ago
- A Simple Tool to Distribute/Administrate Your Scripts☆12Sep 10, 2015Updated 10 years ago
- ☆17Jul 14, 2017Updated 8 years ago
- 基于Redis的Bloomfilter去重,并将其扩展到Scrapy框架。☆347Feb 26, 2023Updated 3 years ago
- 批处理延迟任务队列☆53Aug 8, 2013Updated 12 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- 《App安全实战指南:Android和iOS App的安全攻防与合规 》书中的示例代码和相关工具☆36Sep 2, 2024Updated last year
- Auto Extractor Module☆334Aug 19, 2024Updated last year
- Scrapy + Puppeteer☆110Jun 11, 2021Updated 4 years ago
- 可视化任务调度系统,精简到一个二进制文件 (Web visual task scheduler system , yes ! just one binary solve all the problems !)☆194Mar 21, 2026Updated last week
- Helm Chart for Samba4☆10Oct 13, 2021Updated 4 years ago
- The classic movies redux with machine learning using TensorFlow and Keras.☆11Feb 12, 2019Updated 7 years ago
- 一款优秀的在线文件预览解决方案,,使用主流springboot+maven搭建,支持doc、docx、ppt、pptx、xls、xlsx、zip、rar、mp4、mp3以及众多类文本如txt、html、xml、java、properties、sql、js、md、json、c…☆16Dec 6, 2022Updated 3 years ago
- HTML DOM Query Language for XGo☆42Feb 16, 2026Updated last month
- 小蓝本(https://www.xiaolanben.com/) 爬虫的 h_sign 签名JSRPC实现。nodejs 补环境也实现了☆13Apr 30, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple sync tool to sync task from Workflowy to Teambition☆32Oct 4, 2017Updated 8 years ago
- Prediction model for Kaggle/Rossmann competition.☆13Nov 23, 2015Updated 10 years ago
- 百万英雄/冲顶大会/知识超人 答题助手 瞬间使用Chrome打开百度☆101Jan 21, 2018Updated 8 years ago
- 书籍《Python3 反爬虫原理与绕过实战》配套代码☆628Oct 25, 2021Updated 4 years ago
- This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.☆1,230Nov 7, 2023Updated 2 years ago
- 安卓应用层抓包通杀脚本☆10Jan 4, 2021Updated 5 years ago
- Pyppeteer integration for Scrapy☆58Feb 26, 2021Updated 5 years ago
- A chrome extension to get XPath of list items in webpage easily.☆35Mar 11, 2022Updated 4 years ago
- Scrapy Pyppeteer Demo☆24Jul 13, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Scrapy Universal Spider☆56Aug 26, 2017Updated 8 years ago
- Strong, Simple, and Precise, (and now async!) security for Sanic APIs☆14Jul 23, 2024Updated last year
- homepage☆10Feb 15, 2023Updated 3 years ago
- Sync your workflowy's items into evernote☆32Dec 8, 2022Updated 3 years ago
- 一个纯 Go 实现的游戏手柄网络转发工具(实际上用的库并不纯)☆13May 8, 2020Updated 5 years ago
- ScrapyDemo : Redis MySQLdb logging IngoreHttpRequestMiddleware UserAgentMiddleware HttpProxyMiddleware rules☆38Jun 28, 2016Updated 9 years ago
- 基于asyncio与aiohttp的异步协程爬虫框架 欢迎Star☆35Oct 25, 2019Updated 6 years ago