Full stack data engineering tools and infrastructure set-up
☆57Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for data-engineering-devops
Users that are interested in data-engineering-devops are comparing it to the libraries listed below
Sorting:
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆773Sep 3, 2024Updated last year
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 3, 2024Updated last year
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆28Jul 2, 2022Updated 3 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- This is the final project that after participated the Data Engineering Zoomcamp☆11Apr 4, 2022Updated 3 years ago
- dagster scikit-learn pipeline example.☆46Mar 18, 2023Updated 2 years ago
- Guide to run Visual Studio Code and/or VSCodium on Android and then SSH-ing into a remote server for remote development. No root required…☆36Apr 4, 2025Updated 10 months ago
- ☆13Oct 4, 2023Updated 2 years ago
- ☆14Oct 10, 2025Updated 4 months ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Feb 18, 2026Updated last week
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- ☆20Dec 19, 2023Updated 2 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated last year
- The Open-Source Enterprise Data Platform in a single Portal☆264Feb 20, 2026Updated last week
- NoGraphs is a library that simplifies the analysis of graphs that can not or should not be fully computed, stored or adapted, e.g., infin…☆25Feb 6, 2026Updated 3 weeks ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Jun 25, 2023Updated 2 years ago
- Apache Hive Metastore as a Standalone server in Docker☆80Aug 22, 2024Updated last year
- All the code related to building my own data lake☆21May 22, 2023Updated 2 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- ☆22Jul 14, 2020Updated 5 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆27Apr 12, 2023Updated 2 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- My notes on the Certified Kubernetes Administrator (CKA) exam and how to prepare.☆22Oct 14, 2019Updated 6 years ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Jun 1, 2021Updated 4 years ago
- OpenConext SAML 2.0 IdP/SP Gateway☆17Updated this week
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆262Dec 13, 2025Updated 2 months ago
- Demo Project for Open Source MDS☆170Aug 27, 2025Updated 6 months ago
- A tool to generate PySpark schema from JSON.☆28Jan 21, 2024Updated 2 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆31Apr 13, 2023Updated 2 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- ☆81Updated this week
- Sample project to demonstrate data engineering best practices☆204Feb 24, 2024Updated 2 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 2 years ago