TeamHG-Memex/scrapy-kafka-export

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TeamHG-Memex/scrapy-kafka-export)

TeamHG-Memex / scrapy-kafka-export

Scrapy extension which writes crawled items to Kafka

☆31

Alternatives and similar repositories for scrapy-kafka-export

Users that are interested in scrapy-kafka-export are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dfdeshom / scrapy-kafka
View on GitHub
Kafka-based components for Scrapy
☆78Apr 10, 2018Updated 8 years ago
TeamHG-Memex / scrapy-dockerhub
View on GitHub
[UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.
☆12Apr 8, 2026Updated 3 months ago
scrapinghub / exporters
View on GitHub
Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations
☆39May 21, 2024Updated 2 years ago
TeamHG-Memex / url-summary
View on GitHub
Show summary of a large number of URLs in a Jupyter Notebook
☆19Apr 8, 2026Updated 3 months ago
yorks / mpfhandler
View on GitHub
a mutiple processes timed rotate logging file handler(base logging.RotatingFileHandler, ConcurrentLogHandler)
☆22Dec 16, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
stummjr / scrapy-fieldstats
View on GitHub
A Scrapy extension to log items coverage when the spider shuts down
☆18Apr 11, 2020Updated 6 years ago
scrapinghub / aduana
View on GitHub
Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…
☆54May 21, 2024Updated 2 years ago
dumganhar / learn-travis
View on GitHub
☆13Jul 16, 2013Updated 13 years ago
scrapy / pypydispatcher
View on GitHub
A fork of http://pydispatcher.sourceforge.net/ with PyPy support
☆16Jul 3, 2017Updated 9 years ago
scrapy-plugins / scrapy-jsonschema
View on GitHub
Scrapy schema validation pipeline and Item builder using JSON Schema
☆45Mar 26, 2021Updated 5 years ago
scrapinghub / scrapy-autounit
View on GitHub
Automatic unit test generation for Scrapy.
☆58Jul 12, 2021Updated 5 years ago
rfyiamcool / redis-cluster-dockerfile
View on GitHub
redis-cluster-dockerfile
☆11May 18, 2015Updated 11 years ago
pujiaxin33 / JXTransition
View on GitHub
自定义转场动画
☆12Dec 9, 2015Updated 10 years ago
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,226Nov 7, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TeamHG-Memex / Formasaurus
View on GitHub
Formasaurus tells you the type of an HTML form and its fields using machine learning
☆121Apr 8, 2026Updated 3 months ago
un4gt / flasky-cli
View on GitHub
CLI to start Flask projects
☆12Aug 24, 2024Updated last year
scrapinghub / scrapyrt
View on GitHub
HTTP API for Scrapy spiders
☆882Jun 29, 2026Updated last month
anasdjebbari / Python-Movie-Recommendation
View on GitHub
Flask based Movie Recommendation System
☆12May 1, 2023Updated 3 years ago
piglei / pycronic
View on GitHub
A crontab script wrapper written in python
☆19Oct 12, 2021Updated 4 years ago
GridPlus / cryptobridge-client
View on GitHub
A client to bridge two EVM blockchain networks
☆12Feb 16, 2018Updated 8 years ago
ddviplinux / crawler-framework
View on GitHub
分布式爬虫框架,基于webdrvier模拟用户请求,kafka消息传递,分布式网页存储使用hbase,task异步任务多线程解析,提供基础服务如:proxy ip服务和号码验证服务等, proxy page使用H5和we版进行接入
☆13Dec 18, 2015Updated 10 years ago
Verubato / framesort
View on GitHub
A simple WoW add-on that sorts party, arena, and raid frames.
☆16Jul 2, 2026Updated 3 weeks ago
timelfrink / flask-api
View on GitHub
In this repo I show how to simple create an API for your machine learning models in Python
☆12Nov 28, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bestswifter / iOS-Source-Code-Analyze
View on GitHub
深入解析 iOS 开源项目
☆11Aug 31, 2016Updated 9 years ago
DeveloperErenLiu / EasyNSCoding
View on GitHub
简单方便的实现NSCoding、NSCopying协议，一行代码即可实现。希望各位能帮忙点个Star，谢谢！
☆11Jun 7, 2018Updated 8 years ago
chiahsien / NSTimer-Block
View on GitHub
Add block ability to NSTimer to avoid common retain cycle issues.
☆11Apr 5, 2017Updated 9 years ago
scrapy-plugins / scrapy-jsonrpc
View on GitHub
Scrapy extension to control spiders using JSON-RPC
☆299Aug 26, 2019Updated 6 years ago
shyam1998 / Movie-Recommendation-System-GUI
View on GitHub
movie-recommendation-system-GUI
☆10Aug 15, 2020Updated 5 years ago
scrapy / scrapy-bench
View on GitHub
A CLI for benchmarking Scrapy.
☆32Jun 28, 2025Updated last year
lenskit / lk-demo-experiment
View on GitHub
Example project for running LensKit experiments
☆13Apr 20, 2026Updated 3 months ago
scrapinghub / skinfer
View on GitHub
Skinfer is a tool for inferring and merging JSON schemas
☆141Apr 24, 2024Updated 2 years ago
MirkoRossini / django-redis-engine
View on GitHub
Django Redis engine for Django Nonrel
☆48May 24, 2011Updated 15 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lemoncode21 / Fastapi-JWTtoken
View on GitHub
☆10Nov 21, 2022Updated 3 years ago
fraglab / nameko-proxy
View on GitHub
Standalone async proxy to communicate with Nameko microservices
☆12Sep 6, 2019Updated 6 years ago
ShaneHeX / sicp-python3
View on GitHub
sicp的python3版本
☆10Jun 14, 2017Updated 9 years ago
backslash112 / book_scraper_python
View on GitHub
A demo to use the BeautifulSoup Python package to get the book informations from websites
☆15Oct 2, 2020Updated 5 years ago
MnogoByte / celery-graceful-stop
View on GitHub
Celery plugin provides ability of graceful worker stopping.
☆17Mar 29, 2016Updated 10 years ago
enclosed-money / contracts
View on GitHub
☆14Sep 20, 2022Updated 3 years ago
GemTrackerClub / GemTracker
View on GitHub
🦄 Instant notification about any changes from decentralized world straight to you (newly added tokens, whales movements, token transfers…
☆15Jan 18, 2021Updated 5 years ago