databand-ai / awesome-apache-airflowLinks
Curated list of resources about Apache Airflow
โ19Updated 4 years ago
Alternatives and similar repositories for awesome-apache-airflow
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
Sorting:
- ๐ Run, schedule, and manage your dbt jobs using Kubernetes.โ25Updated 7 years ago
- Auto-generated Diagrams from Airflow DAGs. ๐ฎ ๐ชโ351Updated this week
- A CLI to manage and monitor permissions in AWS Lake Formationโ25Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRโ39Updated 8 months ago
- Automated data quality suggestions and analysis with Deequ on AWS Glueโ88Updated 2 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concuโฆโ77Updated 7 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMRโ67Updated 3 years ago
- ๐ Docker image for AWS Glue Spark/Pythonโ23Updated 2 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesโ64Updated 3 years ago
- โ97Updated 2 years ago
- Example code for running Spark and Hive jobs on EMR Serverless.โ168Updated 9 months ago
- Fast iterative local development and testing of Apache Airflow workflowsโ201Updated 2 months ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested daโฆโ113Updated 2 years ago
- โ73Updated last year
- โ23Updated last year
- โ202Updated 2 years ago
- Benchmark data warehouses under Fivetran-like conditionsโ171Updated 2 years ago
- Schema modelling framework for decentralised domain-driven ownership of data.โ259Updated last year
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsโ30Updated 2 years ago
- rb_status_plugin : Data confidence tool for Airflowโ12Updated 2 years ago
- Pylint plugin for static code analysis on Airflow codeโ96Updated 5 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data piโฆโ96Updated last month
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.โ52Updated 2 years ago
- Faker for Snowflake!โ33Updated 2 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMRโ17Updated 6 months ago
- Spark runtime on AWS Lambdaโ111Updated 2 months ago
- Spark ETL example processing New York taxi rides public dataset on EKSโ44Updated 2 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.โ48Updated 2 years ago
- Sample Airflow DAGsโ63Updated 2 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for Aโฆโ41Updated 3 years ago