Full stack data engineering tools and infrastructure set-up
☆58Feb 13, 2021Updated 5 years ago
Alternatives and similar repositories for data-engineering-devops
Users that are interested in data-engineering-devops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆806Updated this week
- dlt-dagster-demo☆14Nov 6, 2023Updated 2 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Mar 24, 2026Updated 3 months ago
- A curated list of dagster code snippets for data engineers☆55Feb 26, 2024Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆27Nov 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is the final project that after participated the Data Engineering Zoomcamp☆11Apr 4, 2022Updated 4 years ago
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- dagster scikit-learn pipeline example.☆46Mar 18, 2023Updated 3 years ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated 2 years ago
- ☆10Jun 16, 2026Updated last week
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆29Jul 2, 2022Updated 3 years ago
- ☆15Oct 10, 2025Updated 8 months ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Local development environment for python data projects, with Docker☆23Dec 14, 2022Updated 3 years ago
- Udacity Data Engineering Nano Degree Project, Data Modeling for fact and dimension tables, and ETL pipeline that transfers data from file…☆10Dec 12, 2020Updated 5 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆264Apr 5, 2026Updated 2 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆32Updated this week
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck☆236May 15, 2026Updated last month
- ☆22Jul 14, 2020Updated 5 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- ☆21Dec 19, 2023Updated 2 years ago
- ☆30Aug 2, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The Open-Source Enterprise Data Platform in a single Portal☆265Updated this week
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated 2 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Oct 20, 2022Updated 3 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- Project utilising data from the Age of Empires api at 'https://aoestats.io'☆53Dec 8, 2024Updated last year
- All the code related to building my own data lake☆21May 22, 2023Updated 3 years ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆27Dec 20, 2022Updated 3 years ago
- ☆10Sep 26, 2023Updated 2 years ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Jun 1, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Utilizando o GitHub para expor seus projetos de Data Science - Materiais☆17Apr 27, 2021Updated 5 years ago
- PII anonymization (MCP + Skill for Claude and CLI for the rest)☆117Jun 15, 2026Updated 2 weeks ago
- Simultating concurrent writes to sqlite3 with multiprocessing and pytest☆22Oct 20, 2021Updated 4 years ago
- Source code of webpro.nl☆11Oct 12, 2025Updated 8 months ago
- ☆32Aug 13, 2018Updated 7 years ago
- Udacity Data Engineering Nanodegree Capstone Project☆36May 9, 2020Updated 6 years ago
- Debian 10 (Buster) based Docker in Docker container for Ansible playbook and role testing.☆10Apr 17, 2021Updated 5 years ago