Some example projects for Data Engineers to build, end-to-end.
☆39Nov 8, 2023Updated 2 years ago
Alternatives and similar repositories for DataEngineeringProjects
Users that are interested in DataEngineeringProjects are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- This is the end to end MLOps project I built through participated the MLOps Zoomcamp☆10Sep 11, 2022Updated 3 years ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆255Dec 19, 2025Updated 4 months ago
- End-to-end data platform leveraging the Modern data stack☆52Apr 10, 2024Updated 2 years ago
- Data Engineering Practice Problems☆2,653Jan 8, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is a collecton of Amazon CDK projects to show how to directly ingest streaming data from Amazon Mananged Service for Apache Kafka (M…☆16Sep 10, 2024Updated last year
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- how to unit test your PySpark code☆29Mar 26, 2021Updated 5 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Aug 14, 2023Updated 2 years ago
- ☆20Jan 23, 2023Updated 3 years ago
- Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.☆10Jan 23, 2023Updated 3 years ago
- This repo is for LinkedIn Learning course: Advanced RAG Applications with Vector Databases☆29Oct 17, 2024Updated last year
- Hexagonal (ports and adapters) architecture applied to Spark and Python data engineering project☆33Jul 26, 2023Updated 2 years ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆37Jan 23, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆31Apr 2, 2023Updated 3 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆39Feb 3, 2024Updated 2 years ago
- Obsidian plugin to apply spaced repetition to incrementally develop your notes.☆22Dec 19, 2025Updated 4 months ago
- Fundamentals of Apache Flink [video], published by Packt☆12Jan 30, 2023Updated 3 years ago
- ☆16Apr 1, 2024Updated 2 years ago
- AWS IoT Services sample codes. these will be used in AWS IoT hands-on/workshops in Japan.☆11Jan 14, 2021Updated 5 years ago
- GitHub Action for use with python package interrogate☆11Nov 12, 2024Updated last year
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆14Dec 27, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is a custom project for WGU, the original project repo is https://github.com/udacity/nd0821-c2-build-model-workflow-starter☆12Feb 1, 2026Updated 2 months ago
- ☆21May 17, 2025Updated 11 months ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- This Guidance helps customers design a resilient batch process application using AWS services☆19Mar 1, 2026Updated last month
- Sample code for building a Python application for Apache Flink on Kinesis Data Analytics.☆14Aug 30, 2023Updated 2 years ago
- LLM Building Blocks for Python Course☆17Nov 17, 2025Updated 5 months ago
- ☆17Nov 7, 2024Updated last year
- Clinical trial data analytic recipes in R for SAS users☆27Sep 3, 2024Updated last year
- A simple python tool to reverse search whois by name and email from free services☆19May 5, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An experiment, a playground, a sandbox, a toy — LLMs judging code.☆10Jan 28, 2025Updated last year
- Atorus clinical SAS Programming Macros☆31Oct 3, 2025Updated 6 months ago
- ☆11Mar 24, 2021Updated 5 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Oct 20, 2022Updated 3 years ago
- Courses and projects on Data Camp☆11Jun 28, 2020Updated 5 years ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- Repository containing projects and summaries of my studies in the field of Data Engineering.☆54Jan 28, 2026Updated 3 months ago