This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data engineering professionals.
☆17May 23, 2024Updated last year
Alternatives and similar repositories for DockerForDataEngineers
Users that are interested in DockerForDataEngineers are comparing it to the libraries listed below
Sorting:
- This course is designed to provide learners with the fundamental skills needed for data engineering using Python. The objective is to int…☆24Aug 15, 2024Updated last year
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆64Mar 9, 2024Updated last year
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 9 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Jan 4, 2024Updated 2 years ago
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆11Jan 24, 2021Updated 5 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆12Nov 6, 2023Updated 2 years ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- Repository for the dbt Semantic Layer course☆11Nov 13, 2025Updated 3 months ago
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated last week
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- A Machine Learning project for Machine Learning Internship offered by InternshipStudio.☆12Aug 8, 2021Updated 4 years ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- This project was to predict the number of requests for the Tepsi company competition. In 2018, this project was done with Python, FastAPI…☆12May 3, 2025Updated 9 months ago
- In this tutorial, we have added step-by-step instructions to build your own AI chatbot with ChatGPT API. From setting up tools to install…☆11Apr 13, 2023Updated 2 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- ☆11Aug 20, 2024Updated last year
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- This repository contains a diverse collection of case studies and use cases commonly asked in data science interviews across different co…☆15Apr 14, 2024Updated last year
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆11May 25, 2023Updated 2 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 2 months ago
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Forecasting Netflix Customer Retention based on Gaussian Process Regression☆14Jul 22, 2023Updated 2 years ago
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- ☆12Jan 24, 2025Updated last year
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 3 months ago
- In this project, we have to create a predictive model which allows the company to maximize the profit of the next marketing campaign☆12Oct 18, 2025Updated 4 months ago
- ☆15Dec 11, 2023Updated 2 years ago
- National Stock Exchange (India) (nseindia.com) Web-Scraping For collecting data for real-time visualization and machine learning projects…☆16Aug 11, 2024Updated last year
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- FastAPI CLI is a command-line tool designed to help developers quickly generate a structured project file system for FastAPI applications…☆12Feb 3, 2025Updated last year
- Tapsi Ride Demand Prediction☆12Jan 13, 2026Updated last month
- the full pipeline for model retraining with fastapi and github actions☆16Jul 5, 2024Updated last year
- A fully featured banking API built with FastAPI,Docker,Celery,Redis,RabbitMQ with an AI/ML transaction analysis and fraud detection syste…☆18Sep 4, 2025Updated 5 months ago
- dbt package for EDU's Ed-Fi data warehouse☆17Feb 17, 2026Updated last week
- A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset☆54Dec 13, 2025Updated 2 months ago
- Numpy main repository☆13Oct 22, 2025Updated 4 months ago