Some example projects for Data Engineers to build, end-to-end.
☆39Nov 8, 2023Updated 2 years ago
Alternatives and similar repositories for DataEngineeringProjects
Users that are interested in DataEngineeringProjects are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- This is the end to end MLOps project I built through participated the MLOps Zoomcamp☆10Sep 11, 2022Updated 3 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆255Dec 19, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data Engineering Practice Problems☆2,673Jan 8, 2025Updated last year
- This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…☆16Sep 10, 2024Updated last year
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- how to unit test your PySpark code☆29Mar 26, 2021Updated 5 years ago
- ☆20Jan 23, 2023Updated 3 years ago
- Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.☆10Jan 23, 2023Updated 3 years ago
- CalData's MDSA project with Caltrans on Performance Measurement System (PeMS) data☆12Aug 19, 2025Updated 9 months ago
- Databricks CI/CD using Azure DevOps☆21Nov 1, 2022Updated 3 years ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆37Jan 23, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆31Apr 2, 2023Updated 3 years ago
- ☆14Aug 6, 2023Updated 2 years ago
- Simulate SDTM datasets in SAS.☆12Jan 19, 2018Updated 8 years ago
- A complete pipeline to pull data from Scryfall's "Magic: The Gathering"-API, via Prefect orchestration and dbt transformation.☆43Apr 27, 2023Updated 3 years ago
- Obsidian plugin to apply spaced repetition to incrementally develop your notes.☆22Dec 19, 2025Updated 5 months ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 3 years ago
- ☆16Apr 1, 2024Updated 2 years ago
- ☆21Mar 11, 2025Updated last year
- ☆12Sep 24, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Jan 30, 2023Updated 3 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- Coding with ChatGPT and other LLMs, published by Packt☆16Dec 9, 2024Updated last year
- A simple script for backing up your favorite YouTube channels.☆12Jan 27, 2024Updated 2 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- Data warehouse tech stack with PostgreSQL, DBT and Airflow☆20Dec 29, 2025Updated 4 months ago
- A curated list of awesome SQLMesh resources☆38Apr 30, 2025Updated last year
- This Guidance helps customers design a resilient batch process application using AWS services☆19Mar 1, 2026Updated 2 months ago
- LLM Building Blocks for Python Course☆17Nov 17, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Clinical trial data analytic recipes in R for SAS users☆28Sep 3, 2024Updated last year
- Modern Data Engineering Project☆12Jun 3, 2022Updated 3 years ago
- A simple python tool to reverse search whois by name and email from free services☆19May 5, 2020Updated 6 years ago
- An experiment, a playground, a sandbox, a toy — LLMs judging code.☆10Jan 28, 2025Updated last year
- Comprehensive Python client for the Uniprot REST API☆56Oct 6, 2025Updated 7 months ago
- ☆19Mar 27, 2020Updated 6 years ago
- ☆16Jul 29, 2021Updated 4 years ago