databand-ai / awesome-apache-airflow
Curated list of resources about Apache Airflow
☆19Updated 4 years ago
Alternatives and similar repositories for awesome-apache-airflow
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
Sorting:
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- Utility functions for dbt projects running on Spark☆33Updated 3 months ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Delta Lake Documentation☆49Updated 10 months ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆44Updated last month
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆36Updated 2 months ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆18Updated 2 weeks ago
- Library to convert DBT manifest metadata to Airflow tasks☆48Updated last year
- re_data - fix data issues before your users & CEO would discover them 😊☆98Updated last year
- ☆21Updated 3 years ago
- a dbt package to make auditing dbt runs easy.☆100Updated 5 months ago
- Oozie Workflow to Airflow DAGs migration tool☆87Updated 2 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆168Updated last year
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 2 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- Pylint plugin for static code analysis on Airflow code☆94Updated 4 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 8 months ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- ☆199Updated last year
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆85Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆83Updated last year
- A curated list of dagster code snippets for data engineers☆55Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- Run dbt serverless in the Cloud (AWS)☆42Updated 5 years ago
- ☆24Updated 5 years ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- Faker for Snowflake!☆33Updated 2 years ago