jghoman / awesome-apache-airflow
Curated list of resources about Apache Airflow
☆3,691Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-apache-airflow
- ETL best practices with airflow, with examples☆1,295Updated last month
- Docker Apache Airflow☆3,776Updated last year
- A series of DAGs/Workflows to help maintain the operation of Airflow☆1,681Updated 5 months ago
- Dynamically generate Apache Airflow DAGs from YAML configuration files☆1,209Updated this week
- Guides and docs to help you get up and running with Apache Airflow.☆800Updated 2 years ago
- A curated list of awesome ETL frameworks, libraries, and software.☆3,287Updated 3 months ago
- A curated list of awesome Apache Spark packages and resources.☆1,722Updated 3 weeks ago
- Code for Data Pipelines with Apache Airflow☆719Updated 3 months ago
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,204Updated last month
- Apache Airflow tutorial☆934Updated 2 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,309Updated last month
- A curated list of data engineering tools for software developers☆462Updated 7 years ago
- A docker image and kubernetes config files to run Airflow on Kubernetes☆655Updated 5 years ago
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,440Updated last week
- Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io☆1,913Updated this week
- A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow☆2,081Updated 11 months ago
- A curated list of data engineering tools for software developers☆6,828Updated 3 weeks ago
- Airflow basics tutorial☆397Updated 3 years ago
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆9,976Updated this week
- Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes☆465Updated this week
- Collect, aggregate, and visualize a data ecosystem's metadata☆1,781Updated last week
- re_data - fix data issues before your users & CEO would discover them 😊☆1,552Updated 6 months ago
- Utility functions for dbt projects.☆1,379Updated last week
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,291Updated 4 years ago
- A plugin for Apache Airflow that allows you to edit DAGs in browser☆403Updated 2 weeks ago
- The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.☆241Updated 3 years ago
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,481Updated this week
- This repository is a getting started guide to Singer.☆1,272Updated 2 months ago
- Python API for Deequ☆730Updated last month
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆37,168Updated this week