databand-ai / awesome-apache-airflowLinks
Curated list of resources about Apache Airflow
โ19Updated 4 years ago
Alternatives and similar repositories for awesome-apache-airflow
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
Sorting:
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRโ39Updated 11 months ago
- ๐ Docker image for AWS Glue Spark/Pythonโ23Updated 2 years ago
- A CLI to manage and monitor permissions in AWS Lake Formationโ25Updated 2 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glueโ90Updated 3 years ago
- Auto-generated Diagrams from Airflow DAGs. ๐ฎ ๐ชโ354Updated this week
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesโ63Updated 3 years ago
- ๐ Run, schedule, and manage your dbt jobs using Kubernetes.โ25Updated 7 years ago
- Spark runtime on AWS Lambdaโ113Updated 4 months ago
- Superglue is a lineage-tracking tool built to help visualize the propagation of data through complex pipelines composed of tables, jobs โฆโ160Updated 3 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsโ31Updated 2 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMRโ67Updated 4 years ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested daโฆโ113Updated 2 years ago
- Faker for Snowflake!โ33Updated 3 years ago
- Example code for running Spark and Hive jobs on EMR Serverless.โ168Updated last year
- Utility functions for dbt projects running on Sparkโ34Updated last month
- The open source version of the Amazon Redshift Cluster Management Guide.โ48Updated 2 years ago
- โ201Updated 2 years ago
- A bunch of hacks developed around dbtโ48Updated 6 years ago
- Pylint plugin for static code analysis on Airflow codeโ96Updated 5 years ago
- Benchmark data warehouses under Fivetran-like conditionsโ172Updated 3 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.โ260Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.โ168Updated 2 years ago
- Sample configuration to deploy a modern data platform.โ89Updated 4 years ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....โ78Updated this week
- The Picnic Data Vault framework.โ130Updated last week
- Build DataOps platform with Apache Airflow and dbt on AWSโ59Updated 4 years ago
- This repository contains the dbt-glue adapterโ139Updated 2 weeks ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormatiโฆโ118Updated last month
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL databaseโ76Updated 4 years ago
- โ100Updated 2 years ago