Some example projects for Data Engineers to build, end-to-end.
☆39Nov 8, 2023Updated 2 years ago
Alternatives and similar repositories for DataEngineeringProjects
Users that are interested in DataEngineeringProjects are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆117Jan 1, 2023Updated 3 years ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆254Dec 19, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- End-to-end data platform leveraging the Modern data stack☆52Apr 10, 2024Updated last year
- Data Engineering Practice Problems☆2,597Jan 8, 2025Updated last year
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- how to unit test your PySpark code☆29Mar 26, 2021Updated 5 years ago
- ☆19Jan 23, 2023Updated 3 years ago
- Reusable github actions workflows for R packages☆16Mar 30, 2026Updated last week
- A tool for comparing the design and analysis of surveys by simulating spatially-correlated populations☆12Sep 16, 2025Updated 6 months ago
- CalData's MDSA project with Caltrans on Performance Measurement System (PeMS) data☆12Aug 19, 2025Updated 7 months ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆38Jan 23, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆31Apr 2, 2023Updated 3 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆40Feb 3, 2024Updated 2 years ago
- A complete pipeline to pull data from Scryfall's "Magic: The Gathering"-API, via Prefect orchestration and dbt transformation.☆43Apr 27, 2023Updated 2 years ago
- Obsidian plugin to apply spaced repetition to incrementally develop your notes.☆22Dec 19, 2025Updated 3 months ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆17Oct 1, 2019Updated 6 years ago
- This repository is a curated collection of information (keywords, papers, libraries, books, etc.) about counterfactual explanations🙃 Con…☆23Oct 27, 2022Updated 3 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 2 years ago
- This repository contains the code snippets used in "LLM Prompt Engineering For Developers"☆12Apr 22, 2024Updated last year
- Learn Spanish conjugation the easy way☆19Aug 16, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Jan 30, 2023Updated 3 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆14Dec 27, 2023Updated 2 years ago
- ☆14Jan 3, 2020Updated 6 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- Coding with ChatGPT and other LLMs, published by Packt☆16Dec 9, 2024Updated last year
- Generate proxy versions of cards/decks you are interested in purchasing!☆16May 18, 2025Updated 10 months ago
- The Godot 4 Beginners card game example☆14Jul 28, 2025Updated 8 months ago
- Medieval strategy autobattling deckbuilder. My First Game Jam: Summer 2020.☆17Apr 28, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A simple script for backing up your favorite YouTube channels.☆12Jan 27, 2024Updated 2 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- A curated list of awesome SQLMesh resources☆38Apr 30, 2025Updated 11 months ago
- This Guidance helps customers design a resilient batch process application using AWS services☆19Mar 1, 2026Updated last month
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- An introductory university course designed to equip students with a comprehensive understanding of various cloud computing models, archit…☆25Jul 3, 2023Updated 2 years ago
- Data Engineering Course☆23Jun 4, 2024Updated last year