Code for Data Pipelines with Apache Airflow
☆823Aug 15, 2024Updated last year
Alternatives and similar repositories for data-pipelines-with-apache-airflow
Users that are interested in data-pipelines-with-apache-airflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Curated list of resources about Apache Airflow☆3,922May 7, 2026Updated last month
- Example end to end data engineering project.☆1,414Dec 8, 2022Updated 3 years ago
- ETL best practices with airflow, with examples☆1,353Sep 25, 2024Updated last year
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆104Aug 11, 2019Updated 6 years ago
- Beginner data engineering project - batch edition☆582Apr 13, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Personal Data Engineering Projects☆1,021Feb 8, 2023Updated 3 years ago
- Apache Airflow tutorial☆976Nov 3, 2022Updated 3 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Oct 24, 2023Updated 2 years ago
- Apache Airflow integration for dbt☆415May 17, 2024Updated 2 years ago
- The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.☆256Jun 25, 2021Updated 5 years ago
- Construct Apache Airflow DAGs Declaratively via YAML configuration files☆1,440Jun 26, 2026Updated last week
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,770Jun 18, 2024Updated 2 years ago
- Docker with Airflow and Spark standalone cluster☆265Aug 5, 2023Updated 2 years ago
- Great Expectations Airflow operator☆174Apr 1, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Docker Apache Airflow☆3,807Mar 1, 2023Updated 3 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Nov 22, 2021Updated 4 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,519Mar 9, 2020Updated 6 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆132Jan 3, 2023Updated 3 years ago
- Solution to all projects of Udacity's Data Engineering Nanodegree: Data Modeling with Postgres & Cassandra, Data Warehouse with Redshift,…☆57Oct 20, 2022Updated 3 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆186Aug 23, 2023Updated 2 years ago
- Airflow basics tutorial☆397Sep 1, 2021Updated 4 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆349Jul 24, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A list of useful resources to learn Data Engineering from scratch☆3,996Jun 19, 2024Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆220Feb 24, 2024Updated 2 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,948Aug 26, 2022Updated 3 years ago
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆42,961Jun 10, 2026Updated 3 weeks ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆167Jun 16, 2020Updated 6 years ago
- Data Engineering with Python, published by Packt☆817Jan 30, 2023Updated 3 years ago
- Data Engineering with AWS, Published by Packt☆343Apr 22, 2026Updated 2 months ago
- Code repository for the "PySpark in Action" book☆216Jun 11, 2025Updated last year
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆46,011Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆350Jan 12, 2022Updated 4 years ago
- Airflow Unit Tests and Integration Tests☆263Nov 16, 2022Updated 3 years ago
- Curated list of resources about Apache Airflow☆19Apr 7, 2021Updated 5 years ago
- Code for dbt tutorial☆181Jun 4, 2026Updated last month
- Making DAG construction easier☆285Jun 22, 2026Updated last week
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆708Oct 15, 2024Updated last year
- This repository provides a command line interface (CLI) utility that replicates an Amazon Managed Workflows for Apache Airflow (MWAA) env…☆806Oct 3, 2025Updated 9 months ago