Repository for Data Engineering Interview Series
☆38Oct 17, 2024Updated last year
Alternatives and similar repositories for data-engineering-interview-series
Users that are interested in data-engineering-interview-series are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Apr 26, 2024Updated 2 years ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated 2 years ago
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆45Jan 4, 2024Updated 2 years ago
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- ☆16Aug 29, 2023Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆105Jun 7, 2024Updated last year
- Simple stream processing pipeline☆112Jun 17, 2024Updated last year
- Cost Efficient Data Pipelines with DuckDB☆61May 14, 2025Updated last year
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated 2 years ago
- Near real time ETL to populate a dashboard.☆75Sep 9, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Jul 15, 2023Updated 2 years ago
- A project from the ml_ops Zoomcamp (DataTalks) using Semiconductor data☆22Sep 7, 2022Updated 3 years ago
- Sample project to demonstrate data engineering best practices☆219Feb 24, 2024Updated 2 years ago
- Primary repository for NYC DCP's Data Engineering team☆40Updated this week
- Sample example projects referenced for opensource.com articles☆11Dec 19, 2023Updated 2 years ago
- Modern partition manager for PostgreSQL☆17May 18, 2023Updated 3 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- BitDust user App written in Python using Kivy framework☆14Aug 23, 2025Updated 9 months ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Aug 20, 2024Updated last year
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆13Nov 6, 2023Updated 2 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- This project contains my solution for all the data structures and algorithms on Algo Expert, Hackerrank and Leetcode. This repository is …☆10Jan 24, 2021Updated 5 years ago
- In this project, we have to create a predictive model which allows the company to maximize the profit of the next marketing campaign☆15Oct 18, 2025Updated 7 months ago
- ☆11Jul 13, 2020Updated 5 years ago
- files created in ardan labs golang training☆12Nov 8, 2023Updated 2 years ago
- Forecasting Netflix Customer Retention based on Gaussian Process Regression☆14Jul 22, 2023Updated 2 years ago
- Classify images of different kitchenware items☆11Apr 17, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An ergonomic, opinionated memory interface for AI agents☆39Dec 18, 2025Updated 5 months ago
- This repository is to show my Data Analytics & Engineering skills, share projects, and track my progress.☆66Jun 25, 2023Updated 2 years ago
- In this tutorial, we have added step-by-step instructions to build your own AI chatbot with ChatGPT API. From setting up tools to install…☆11Apr 13, 2023Updated 3 years ago
- An example project that implements a data pipeline using Scala, Akka, and Spark and works with document-oriented and graph databases to l…☆11Aug 9, 2019Updated 6 years ago
- Beginner data engineering project - batch edition☆583Apr 13, 2026Updated last month
- In this project first we fetch data of any stock(NSE) in realtime then we evaluate the stock price using basics visualizations then we…☆12Mar 24, 2023Updated 3 years ago
- duckdb-etl-framework☆15Dec 20, 2024Updated last year