Deploy a complete data stack in just a couple of minutes.
☆15Mar 6, 2024Updated last year
Alternatives and similar repositories for mini-modern-data-stack
Users that are interested in mini-modern-data-stack are comparing it to the libraries listed below
Sorting:
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆11Nov 8, 2024Updated last year
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated last week
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- A script/docker that automatically translates PDFs using the DeepL API☆11Jan 18, 2026Updated last month
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- ☆17Sep 20, 2021Updated 4 years ago
- ☆10Jan 24, 2023Updated 3 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 2 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- Roadmap for all those who want to get a kick start as Data Scientist.☆11Feb 2, 2022Updated 4 years ago
- Realistic OLTP data simulator for CDC testing with Debezium☆17Nov 5, 2025Updated 4 months ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- The unique data management platform for Julia☆16Apr 25, 2022Updated 3 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 3 months ago
- Para entender e aprender um pouco sobre o Apache Kafka.https://www.youtube.com/channel/UC3pevgVzUWKo5CoWdhDsoHw☆13Jan 8, 2026Updated last month
- Plane moment analysis with Apache Flink complex event processing☆17Jun 14, 2025Updated 8 months ago
- This is the HTML-CSS source code to build my personal website.☆10Nov 13, 2025Updated 3 months ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- The "World Data Report" is a Power BI project that offers a detailed overview of global data, covering weather, geographical, demographic…☆15Nov 30, 2025Updated 3 months ago
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- Celery workers as independent microservices deployed using Docker Swarm.☆11Dec 4, 2020Updated 5 years ago
- This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, M…☆13Jul 7, 2024Updated last year
- dbt package for EDU's Ed-Fi data warehouse☆17Updated this week
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- ☆15Dec 11, 2023Updated 2 years ago
- End-to-end data platform leveraging the Modern data stack☆52Apr 10, 2024Updated last year
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- An open-source plugin for showing RPG-like dialogue for your Roblox game☆12Jun 29, 2025Updated 8 months ago
- ☆17Dec 12, 2025Updated 2 months ago
- Thesis: Measure The Speed Of News Spread in Social Networks For Real-Time Fake News Detection☆10Jun 7, 2021Updated 4 years ago
- Rust + Python Lake House Health Analyzer | Detect • Diagnose • Optimize • Flow☆65Oct 20, 2025Updated 4 months ago
- ☆16Apr 26, 2024Updated last year
- ☆15Apr 29, 2024Updated last year
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 9 months ago
- Practice notebooks for NumPy, Pandas, matplotlib, basic machine learning etc.☆13Nov 20, 2017Updated 8 years ago