databand-ai / awesome-apache-airflowLinks
Curated list of resources about Apache Airflow
☆19Updated 4 years ago
Alternatives and similar repositories for awesome-apache-airflow
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
Sorting:
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆38Updated 6 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- Spark runtime on AWS Lambda☆109Updated last week
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆348Updated last week
- Example code for running Spark and Hive jobs on EMR Serverless.☆167Updated 8 months ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆112Updated last year
- ☆73Updated last year
- A CLI to manage and monitor permissions in AWS Lake Formation☆26Updated 2 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆87Updated 2 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆25Updated 7 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- This repository contains the dbt-glue adapter☆131Updated this week
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormati…☆115Updated last month
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 3 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.☆38Updated 3 years ago
- A Streamlit app that provides insights on your Snowflake account usage.☆61Updated 7 months ago
- Build DataOps platform with Apache Airflow and dbt on AWS☆59Updated 4 years ago
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Sample Airflow DAGs☆63Updated 2 years ago
- Utility functions for dbt projects running on Spark☆33Updated 6 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated 3 weeks ago
- rb_status_plugin : Data confidence tool for Airflow☆12Updated 2 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆30Updated 2 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆104Updated last week
- Benchmark data warehouses under Fivetran-like conditions☆170Updated 2 years ago
- ☆201Updated last year
- Jupyter notebooks and AWS CloudFormation template to show how Hudi, Iceberg, and Delta Lake work☆47Updated 3 years ago
- Sample configuration to deploy a modern data platform.☆88Updated 3 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated 2 years ago