skhatri / airflow-by-exampleLinks
Bunch of Airflow Configurations and DAGs for Kubernetes, Spark based data-pipelines. Scale inside Kubernetes using spark kubernetes master. Secure it with keycloak
☆23Updated 3 years ago
Alternatives and similar repositories for airflow-by-example
Users that are interested in airflow-by-example are comparing it to the libraries listed below
Sorting:
- Astronomer Core Docker Images☆107Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆104Updated this week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- Great Expectations Airflow operator☆168Updated this week
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆149Updated this week
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆218Updated 3 months ago
- Airflow Unit Tests and Integration Tests☆260Updated 2 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated 3 months ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- ☆200Updated last year
- ☆48Updated 3 years ago
- A Helm chart to install Apache Airflow on Kubernetes☆286Updated this week
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Making DAG construction easier☆270Updated 3 weeks ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Delta Lake helper methods. No Spark dependency.☆23Updated 11 months ago
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆86Updated last year
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆218Updated 2 weeks ago
- triggering a DAG run multiple times☆88Updated last year
- Example DAGs using hooks and operators from Airflow Plugins☆346Updated 7 years ago
- ☆23Updated 4 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- Delta Lake examples☆227Updated 10 months ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- New generation opensource data stack☆70Updated 3 years ago
- This repository is part of an article "Prefect workflow automation with Azure DevOps and AKS"☆30Updated 4 years ago