Repository for Data Engineering Interview Series
☆36Oct 17, 2024Updated last year
Alternatives and similar repositories for data-engineering-interview-series
Users that are interested in data-engineering-interview-series are comparing it to the libraries listed below
Sorting:
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- ☆16Apr 26, 2024Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Step by step instructions to create a production-ready data pipeline☆58Dec 23, 2024Updated last year
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- ☆15Dec 11, 2023Updated 2 years ago
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Jun 26, 2023Updated 2 years ago
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated last year
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 9 months ago
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- ☆16Aug 29, 2023Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆43Jan 4, 2024Updated 2 years ago
- Simple stream processing pipeline☆110Jun 17, 2024Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆101Jun 7, 2024Updated last year
- Sample project to demonstrate data engineering best practices☆204Feb 24, 2024Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆62May 14, 2025Updated 9 months ago
- A project from the ml_ops Zoomcamp (DataTalks) using Semiconductor data☆22Sep 7, 2022Updated 3 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- Transaction processing & vis pipeline using PySpark Streaming☆30Jul 18, 2024Updated last year
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- Repository for the dbt Semantic Layer course☆11Nov 13, 2025Updated 3 months ago
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated last week
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated last year
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆12Nov 6, 2023Updated 2 years ago
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆11Jan 24, 2021Updated 5 years ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- A python package for the automatic conversion of EEG datasets to the BIDS standard, with a focus on making the most out of metadata.☆11Feb 26, 2026Updated last week
- ☆11Jun 15, 2023Updated 2 years ago
- Implementation of Mejias et al. 2016: Feedforward and feedback frequency-dependent interactions in a large-scale laminar network of the p…☆10Nov 25, 2025Updated 3 months ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- A Machine Learning project for Machine Learning Internship offered by InternshipStudio.☆12Aug 8, 2021Updated 4 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- Content focused, minimal theme for Hugo☆10Sep 12, 2022Updated 3 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 3 months ago