Full stack data engineering tools and infrastructure set-up
☆58Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for data-engineering-devops
Users that are interested in data-engineering-devops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆800Mar 10, 2026Updated 2 months ago
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 24, 2026Updated last month
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- Data Engineering Project: Extracting music video metrics of Twice using YouTube API, AWS, and Tableau☆32Nov 21, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Apr 4, 2022Updated 4 years ago
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- dagster scikit-learn pipeline example.☆46Mar 18, 2023Updated 3 years ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- ☆13Oct 4, 2023Updated 2 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆29Jul 2, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Oct 10, 2025Updated 7 months ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- Local development environment for python data projects, with Docker☆23Dec 14, 2022Updated 3 years ago
- Udacity Data Engineering Nano Degree Project, Data Modeling for fact and dimension tables, and ETL pipeline that transfers data from file…☆10Dec 12, 2020Updated 5 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆264Apr 5, 2026Updated last month
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆31May 11, 2026Updated last week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck☆236Updated this week
- Workshop on Machine Learning in Python☆19Mar 24, 2016Updated 10 years ago
- ☆22Jul 14, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 3 years ago
- This repository contains the official implementation of the research paper: "Towards Training Large-Scale Pathology Foundation Models: fr…☆38Jan 17, 2025Updated last year
- ☆10Jan 4, 2019Updated 7 years ago
- The Open-Source Enterprise Data Platform in a single Portal☆266Updated this week
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- ☆14Aug 29, 2025Updated 8 months ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Oct 20, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Apache Hive Metastore as a Standalone server in Docker☆80Aug 22, 2024Updated last year
- Project utilising data from the Age of Empires api at 'https://aoestats.io'☆54Dec 8, 2024Updated last year
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- Bigdata on Kubernetes, Published by Packt☆36Oct 1, 2024Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆29Apr 12, 2023Updated 3 years ago
- ☆10Sep 26, 2023Updated 2 years ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Jun 1, 2021Updated 4 years ago