databand-ai / awesome-apache-airflowLinks
Curated list of resources about Apache Airflow
โ19Updated 4 years ago
Alternatives and similar repositories for awesome-apache-airflow
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
Sorting:
- Auto-generated Diagrams from Airflow DAGs. ๐ฎ ๐ชโ354Updated last week
- Automated data quality suggestions and analysis with Deequ on AWS Glueโ89Updated 2 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesโ64Updated 3 years ago
- ๐ Run, schedule, and manage your dbt jobs using Kubernetes.โ25Updated 7 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRโ39Updated 9 months ago
- A CLI to manage and monitor permissions in AWS Lake Formationโ25Updated 2 years ago
- Benchmark data warehouses under Fivetran-like conditionsโ171Updated 2 years ago
- Example code for running Spark and Hive jobs on EMR Serverless.โ169Updated 11 months ago
- This repository contains the dbt-glue adapterโ137Updated this week
- Build DataOps platform with Apache Airflow and dbt on AWSโ59Updated 4 years ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational eโฆโ109Updated 2 months ago
- ๐ Docker image for AWS Glue Spark/Pythonโ23Updated 2 years ago
- โ98Updated 2 years ago
- โ73Updated last year
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsโ31Updated 2 years ago
- A Streamlit app that provides insights on your Snowflake account usage.โ62Updated 10 months ago
- A bunch of hacks developed around dbtโ48Updated 6 years ago
- Spark runtime on AWS Lambdaโ113Updated 3 months ago
- Pylint plugin for static code analysis on Airflow codeโ96Updated 5 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data piโฆโ97Updated last week
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati โฆโ117Updated last week
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog aโฆโ226Updated 8 months ago
- PySpark data-pipeline testing andย CICDโ28Updated 5 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.โ267Updated 8 months ago
- Enforce Best Practices for all your Airflow DAGs. โญโ106Updated last week
- A Helm chart to install Apache Airflow on Kubernetesโ290Updated last week
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested daโฆโ113Updated 2 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contractsโ180Updated last year
- Sample configuration to deploy a modern data platform.โ89Updated 3 years ago
- Faker for Snowflake!โ33Updated 2 years ago