databand-ai / awesome-apache-airflowLinks
Curated list of resources about Apache Airflow
โ19Updated 4 years ago
Alternatives and similar repositories for awesome-apache-airflow
Users that are interested in awesome-apache-airflow are comparing it to the libraries listed below
Sorting:
- ๐ Run, schedule, and manage your dbt jobs using Kubernetes.โ24Updated 6 years ago
- Utility functions for dbt projects running on Sparkโ33Updated 5 months ago
- Weekly Data Engineering Newsletterโ96Updated last year
- A best practices guide for using AWS EMR. The guide will cover best practices on the topics of cost, performance, security, operational eโฆโ107Updated 3 weeks ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframesโ64Updated 3 years ago
- Automated data quality suggestions and analysis with Deequ on AWS Glueโ86Updated 2 years ago
- Rules based grant management for Snowflakeโ40Updated 6 years ago
- A bunch of hacks developed around dbtโ48Updated 5 years ago
- Sample configuration to deploy a modern data platform.โ88Updated 3 years ago
- Full stack data engineering tools and infrastructure set-upโ53Updated 4 years ago
- Delta Lake Documentationโ49Updated last year
- Demo for GitHub Universe 2022โ12Updated 2 years ago
- Amazon Managed Workflows for Apache Airflow (MWAA) Examples repository contains example DAGs, requirements.txt, plugins, and CloudFormatiโฆโ116Updated last week
- locopy: Loading/Unloading to Redshift and Snowflake using Python.โ110Updated this week
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMRโ18Updated 2 months ago
- A Streamlit app that provides insights on your Snowflake account usage.โ60Updated 5 months ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMRโ38Updated 5 months ago
- Example code for running Spark and Hive jobs on EMR Serverless.โ166Updated 6 months ago
- PySpark data-pipeline testing andย CICDโ28Updated 4 years ago
- A curated list of dagster code snippets for data engineersโ56Updated last year
- The open source version of the Amazon Redshift Cluster Management Guide.โ48Updated 2 years ago
- Auto-generated Diagrams from Airflow DAGs. ๐ฎ ๐ชโ344Updated this week
- Cloned by the `dbt init` taskโ60Updated last year
- This repository contains the dbt-glue adapterโ127Updated last week
- A Table format agnostic data sharing frameworkโ38Updated last year
- Pylint plugin for static code analysis on Airflow codeโ95Updated 4 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formatsโ29Updated 2 years ago
- Fast iterative local development and testing of Apache Airflow workflowsโ202Updated 2 months ago
- A curated list of resources about Snowflakeโ239Updated last year
- Command-line interface to quickly generate fake CSV and JSON dataโ73Updated last year