databand-ai / awesome-apache-airflow
Curated list of resources about Apache Airflow
β19Updated 3 years ago
Alternatives and similar repositories for awesome-apache-airflow:
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
- A bunch of hacks developed around dbtβ48Updated 5 years ago
- π Run, schedule, and manage your dbt jobs using Kubernetes.β24Updated 6 years ago
- A CLI to manage and monitor permissions in AWS Lake Formationβ26Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRβ31Updated last month
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsβ29Updated last year
- π Docker image for AWS Glue Spark/Pythonβ23Updated last year
- Faker for Snowflake!β33Updated 2 years ago
- Sample configuration to deploy a modern data platform.β88Updated 3 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesβ63Updated 2 years ago
- Utility functions for dbt projects running on Sparkβ31Updated last month
- Sample code to collect Apache Iceberg metrics for table monitoringβ25Updated 7 months ago
- PySpark data-pipeline testing andΒ CICDβ28Updated 4 years ago
- Full stack data engineering tools and infrastructure set-upβ50Updated 4 years ago
- Demo for GitHub Universe 2022β12Updated 2 years ago
- AWS Quick Start Teamβ18Updated 5 months ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested daβ¦β111Updated last year
- Make simple storing test results and visualisation of these in a BI dashboardβ42Updated last month
- Automated data quality suggestions and analysis with Deequ on AWS Glueβ84Updated 2 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for Aβ¦β40Updated 2 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.β48Updated last year
- Delta Lake Documentationβ49Updated 9 months ago
- Build DataOps platform with Apache Airflow and dbt on AWSβ55Updated 3 years ago
- β22Updated 5 months ago
- Sample Airflow DAGsβ62Updated 2 years ago
- Rules based grant management for Snowflakeβ40Updated 6 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Pythonβ42Updated 2 years ago
- RStoolKit - A utility to perform a complete health check of your AWS RedShift Clusterβ23Updated 4 years ago
- Oozie Workflow to Airflow DAGs migration toolβ88Updated 2 weeks ago
- β20Updated 3 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMRβ18Updated 7 months ago