dsaidgovsg / airflow-pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
☆174Updated last year
Alternatives and similar repositories for airflow-pipeline:
Users that are interested in airflow-pipeline are comparing it to the libraries listed below
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- ☆198Updated last year
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- Airflow Unit Tests and Integration Tests☆256Updated 2 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆336Updated 6 years ago
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆233Updated 2 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- A guide to running Airflow on Kubernetes☆172Updated 5 years ago
- A plugin for Apache Airflow that exposes rest end points for the Command Line Interfaces☆325Updated 4 years ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆65Updated 4 years ago
- Astronomer Core Docker Images☆106Updated 9 months ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- Data ingestion library for Amundsen to build graph and search index☆205Updated last year
- Spark package for checking data quality☆221Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆196Updated 3 months ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- Apache Airflow in Docker Compose (for both versions 1.10.* and 2.*)☆183Updated last year
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- Create HTML profiling reports from Apache Spark DataFrames☆195Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆147Updated 8 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆198Updated 2 years ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- A repository of sample code to accompany our blog post on Airflow and dbt.☆170Updated last year
- Airflow support for Marquez☆32Updated 4 years ago
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆76Updated 6 years ago