This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging the data, filling the data warehouse, and running checks on the data quality as the final step. Automate the ETL pipeline and creation of data warehouse using Apache Airflow. Skills include: Using Airflow to …
☆104Aug 11, 2019Updated 6 years ago
Alternatives and similar repositories for Data-Pipelines-with-Airflow
Users that are interested in Data-Pipelines-with-Airflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Creating Data Pipelines with Apache Airflow to manage ETL from Amazon S3 into Amazon Redshift☆14Jun 12, 2019Updated 7 years ago
- ☆16Dec 4, 2017Updated 8 years ago
- Skeleton project for Apache Airflow training participants to work on.☆17Apr 13, 2026Updated 2 months ago
- Code for Data Pipelines with Apache Airflow☆822Aug 15, 2024Updated last year
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Nov 22, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21Jan 13, 2024Updated 2 years ago
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 2 months ago
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆349Jan 12, 2022Updated 4 years ago
- Using Apache Airflow to author, run and monitor complex data pipelines.☆12Oct 24, 2018Updated 7 years ago
- Example end to end data engineering project.☆1,411Dec 8, 2022Updated 3 years ago
- Airflow Examples: code samples for Medium articles☆14Jan 10, 2021Updated 5 years ago
- For this project I am creating an ETL (Extract, Transform, and Load) pipeline using Python, RegEx, and SQL Database. The goal is to retri…☆26Feb 9, 2021Updated 5 years ago
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆24Nov 19, 2024Updated last year
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆92Jul 17, 2019Updated 6 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆167Jun 16, 2020Updated 6 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Nov 22, 2021Updated 4 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆270Jan 1, 2023Updated 3 years ago
- This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …☆33Feb 9, 2024Updated 2 years ago
- Example of an ETL Pipeline using Airflow☆39Aug 30, 2017Updated 8 years ago
- ETL best practices with airflow, with examples☆1,354Sep 25, 2024Updated last year
- This project focuses on time series forecasting to predict store sales for Corporation Favorita, a large Ecuadorian-based grocery retaile…☆19Dec 4, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Role-Based Access Control System for Python implements ANSI INCITS 359☆15Aug 29, 2025Updated 9 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆114Jan 8, 2026Updated 5 months ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆33Nov 9, 2023Updated 2 years ago
- Divulgando programação em Julia para brasileiros.☆10Jan 28, 2022Updated 4 years ago
- A small repository of projects built in my course, REST APIs with Flask and Python.☆22Dec 19, 2017Updated 8 years ago
- Personal Data Engineering Projects☆1,017Feb 8, 2023Updated 3 years ago
- ☆31Jul 7, 2023Updated 2 years ago
- streaming eight subreddits from reddit api using kafka producer & spark structured streaming.☆19Jun 2, 2026Updated 2 weeks ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆880Apr 16, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆13May 25, 2023Updated 3 years ago
- Project - Data Processing and Analysis in Python Course☆39Oct 10, 2018Updated 7 years ago
- Apache Airflow tutorial☆976Nov 3, 2022Updated 3 years ago
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,513Mar 9, 2020Updated 6 years ago
- In this project I used apache airflow to scrape website periodically. This is for the tutorials I do on youtube.☆10Nov 21, 2022Updated 3 years ago
- ☆12Jan 20, 2023Updated 3 years ago
- Desarrollé un proyecto de ETL sobre archivos de diferentes orígenes (CSV, JSON). Luego, utilicé FastAPI para crear una API que permita re…☆10Dec 9, 2022Updated 3 years ago