yranjan06 / mini_kafkaLinks
A from scratch Python implementation of Apache Kafka concepts including producers, brokers, topics, consumers, and offset management, built to learn distributed messaging without external dependencies.
☆23Updated 3 months ago
Alternatives and similar repositories for mini_kafka
Users that are interested in mini_kafka are comparing it to the libraries listed below
Sorting:
- Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake☆224Updated 4 months ago
- Code for "Efficient Data Processing in Spark" Course☆346Updated 3 weeks ago
- The next-generation engine for dbt☆536Updated last week
- Repo for saving cheat sheets☆61Updated last year
- Quickstart for any service☆165Updated last week
- Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code☆1,073Updated last week
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆315Updated last month
- In this repository we store all materials for dlt workshops, courses, etc.☆233Updated this week
- Port(ish) of Great Expectations to dbt test macros☆1,204Updated 10 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆119Updated 7 months ago
- Project utilising data from the Age of Empires api at 'https://aoestats.io'☆52Updated 11 months ago
- A dbt package from SELECT to help you monitor Snowflake performance and costs☆249Updated this week
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 5 years ago
- Dagster Labs' open-source data platform, built with Dagster.☆412Updated this week
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆338Updated 3 years ago
- ☆161Updated 2 months ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆249Updated last year
- Dagster University courses☆114Updated 2 weeks ago
- ☆37Updated 8 months ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆279Updated last year
- Streaming analytics project with eventsim and Kafka☆12Updated 2 years ago
- Project for "Data pipeline design patterns" blog.☆46Updated last year
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆137Updated 2 years ago
- A curated list of awesome public DBT projects☆153Updated last year
- The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.☆257Updated 4 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆251Updated last month
- This repository helps teach people how to correctly define and create cumulative tables!☆733Updated last year
- Snowflake Connector for Python☆691Updated last week
- A lightweight Python-based tool for extracting and analyzing data column lineage for dbt projects☆186Updated 7 months ago
- PySpark test helper methods with beautiful error messages☆723Updated last month