databand-ai / awesome-apache-airflowLinks
Curated list of resources about Apache Airflow
โ19Updated 4 years ago
Alternatives and similar repositories for awesome-apache-airflow
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
Sorting:
- Automated data quality suggestions and analysis with Deequ on AWS Glueโ86Updated 2 years ago
- Auto-generated Diagrams from Airflow DAGs. ๐ฎ ๐ชโ345Updated this week
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRโ38Updated 5 months ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMRโ18Updated 3 months ago
- A CLI to manage and monitor permissions in AWS Lake Formationโ26Updated 2 years ago
- โ73Updated last year
- Example code for running Spark and Hive jobs on EMR Serverless.โ167Updated 7 months ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormatiโฆโ116Updated last month
- ๐ Run, schedule, and manage your dbt jobs using Kubernetes.โ24Updated 6 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesโ64Updated 3 years ago
- This repository contains the dbt-glue adapterโ128Updated this week
- Spark runtime on AWS Lambdaโ108Updated last week
- Faker for Snowflake!โ33Updated 2 years ago
- ๐ Docker image for AWS Glue Spark/Pythonโ23Updated last year
- Build DataOps platform with Apache Airflow and dbt on AWSโ57Updated 4 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concuโฆโ75Updated 6 years ago
- Utility functions for dbt projects running on Sparkโ33Updated 5 months ago
- Pylint plugin for static code analysis on Airflow codeโ95Updated 4 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMRโ67Updated 3 years ago
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational eโฆโ107Updated last month
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.โ52Updated last year
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for Aโฆโ41Updated 3 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsโ29Updated 2 years ago
- rb_status_plugin : Data confidence tool for Airflowโ12Updated 2 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.โ48Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.โ110Updated this week
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake workโ47Updated 3 years ago
- Reference Architectures for Datalakes on AWSโ79Updated 5 years ago
- โ62Updated 3 years ago
- A Streamlit app that provides insights on your Snowflake account usage.โ60Updated 6 months ago