A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousing, containerization, and a dashboard to monitor data pipeline KPIs
☆15Apr 29, 2021Updated 4 years ago
Alternatives and similar repositories for Data_Engineering_Projects
Users that are interested in Data_Engineering_Projects are comparing it to the libraries listed below
Sorting:
- Acquiring and processing information on world's largest banks☆17Jun 17, 2025Updated 8 months ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Mar 7, 2022Updated 3 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆17Feb 19, 2023Updated 3 years ago
- ☆14May 14, 2024Updated last year
- capstone project for Dataengineer.io bootcamp Public Repo☆12Feb 20, 2024Updated 2 years ago
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- Repository containing example solutions for the Data Engineering Career Path Portfolio Projects☆17Sep 16, 2022Updated 3 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Mar 26, 2025Updated 11 months ago
- End-to-end ELT data engineering project☆22Dec 24, 2022Updated 3 years ago
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- 🎩 AI-powered cover letter generator☆25Jul 13, 2025Updated 7 months ago
- Constructed a dashboard with FastAPI that extracts data from the yfinance API to a SQLAlchemy database.☆21Mar 16, 2025Updated 11 months ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆90Jul 17, 2019Updated 6 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- A quick starter kit to bootstrap NodeJS/ReactJS app☆26Jun 4, 2020Updated 5 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆25Nov 12, 2022Updated 3 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 2 years ago
- Course Material Data Engineering on AWS Course☆31Sep 9, 2024Updated last year
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago
- Telegram bot that manages creation of queues / attendance lists for periodic events.☆15Dec 23, 2024Updated last year
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- Playable synthesizer created with Tone.js, Next.js, and React.☆10Aug 14, 2022Updated 3 years ago
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated last year
- ☆16Feb 20, 2026Updated last week
- Hexagonal (ports and adapters) architecture applied to Spark and Python data engineering project☆33Jul 26, 2023Updated 2 years ago
- 🥢 The simplest way to create REST API with Node.js, Express.js, and TypeORM.☆11Oct 30, 2025Updated 3 months ago
- A tutoring app solves a real problem for students — to find an affordable and knowledgeable tutor on-demand. The design is based on a tw…☆10Nov 18, 2017Updated 8 years ago
- Tools for diffing and comparing web content. Also includes a web server that makes diffs available as an HTTP service.☆15Updated this week
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- Program summarizes news articles into a couple of sentences. This project was inspired by SMMRY, the algorithm used in many subreddits to…☆10Jan 15, 2019Updated 7 years ago
- This web extension allows users to navigate Glassdoor while logged out.☆13Jan 15, 2025Updated last year
- GraphQL API for cloud pricing. Contains over 3M public prices from AWS, Azure and GCP. Self-updates prices via an automated weekly job.☆18Feb 13, 2026Updated 2 weeks ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- 💳 ETL (Extract, Transform and Load) pipeline for calculating stats for a transactions database & testing the efficacy of a loyalty prog…☆10Apr 25, 2017Updated 8 years ago
- 🍕🍔🍟 Delimenú es una aplicación web para que los restaurantes puedan digitalizar sus menús y de esta manera sus usuarios puedan sentirs…☆10Nov 19, 2024Updated last year
- This is the end to end MLOps project I built through participated the MLOps Zoomcamp☆10Sep 11, 2022Updated 3 years ago