Repository for Data Engineering Interview Series
☆36Oct 17, 2024Updated last year
Alternatives and similar repositories for data-engineering-interview-series
Users that are interested in data-engineering-interview-series are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- ☆16Apr 26, 2024Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆92May 5, 2025Updated 10 months ago
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Step by step instructions to create a production-ready data pipeline☆58Dec 23, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- ☆14Dec 11, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 10 months ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆43Jan 4, 2024Updated 2 years ago
- 65 Articles on SQL: A Comprehensive Guide to Mastering Advanced SQL☆11Jun 7, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- ☆14Apr 9, 2024Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆103Jun 7, 2024Updated last year
- Cost Efficient Data Pipelines with DuckDB☆63May 14, 2025Updated 10 months ago
- Project for "Data pipeline design patterns" blog.☆51Aug 6, 2024Updated last year
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated last year
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆64Mar 9, 2024Updated 2 years ago
- ☆16Jul 15, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Sample project to demonstrate data engineering best practices☆211Feb 24, 2024Updated 2 years ago
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- deeplearning.ai on Coursera☆10Mar 4, 2018Updated 8 years ago
- ☆12Jul 18, 2018Updated 7 years ago
- Primary repository for NYC DCP's Data Engineering team☆35Updated this week
- Sample example projects referenced for opensource.com articles☆11Dec 19, 2023Updated 2 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated last year
- A pipeline orchestration tool☆35Aug 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- BitDust user App written in Python using Kivy framework☆14Aug 23, 2025Updated 7 months ago
- Some python scripts for beginners, written for the book Automating The Internet with Python☆13Oct 1, 2018Updated 7 years ago
- ☆11Aug 20, 2024Updated last year
- ☆10May 24, 2021Updated 4 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆12Nov 6, 2023Updated 2 years ago
- People ask me about data science resources so I've curated some here: this is <<20% of the size of an 'awesome' list but has 80% of the v…☆11Jan 14, 2023Updated 3 years ago