skhatri / airflow-by-exampleLinks
Bunch of Airflow Configurations and DAGs for Kubernetes, Spark based data-pipelines. Scale inside Kubernetes using spark kubernetes master. Secure it with keycloak
☆23Updated 3 years ago
Alternatives and similar repositories for airflow-by-example
Users that are interested in airflow-by-example are comparing it to the libraries listed below
Sorting:
- A repository of sample code to show data quality checking best practices using Airflow.☆78Updated 2 years ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆104Updated last week
- Great Expectations Airflow operator☆167Updated last week
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated 2 years ago
- Pylint plugin for static code analysis on Airflow code☆96Updated 5 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆223Updated 6 months ago
- Astronomer Core Docker Images☆106Updated last year
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆85Updated last year
- Make simple storing test results and visualisation of these in a BI dashboard☆48Updated last month
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆149Updated last week
- Pytest plugin for dbt core☆62Updated 9 months ago
- Delta Lake helper methods. No Spark dependency.☆23Updated last year
- New generation opensource data stack☆74Updated 3 years ago
- Airflow training for the crunch conf☆104Updated 7 years ago
- Creates simple data models on Snowflake to report dbt source freshness and tests☆27Updated 2 years ago
- rb_status_plugin : Data confidence tool for Airflow☆12Updated 2 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆90Updated 2 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆201Updated 2 months ago
- Read Delta tables without any Spark☆47Updated last year
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆87Updated 2 weeks ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆113Updated 2 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆30Updated 2 years ago
- Making DAG construction easier☆276Updated last month
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆76Updated 4 years ago
- ☆31Updated 2 years ago
- Airflow Unit Tests and Integration Tests☆261Updated 2 years ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆64Updated 3 years ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆125Updated 9 months ago