wooga / airconditioner
☆15Updated this week
Related projects: ⓘ
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- REST-like API exposing Airflow data and operations☆61Updated 5 years ago
- Pylint plugin for static code analysis on Airflow code☆89Updated 3 years ago
- ☆21Updated this week
- A collection of airflow sample workflows for data processing on aws☆12Updated 6 years ago
- Google Spreadsheets datasource for SparkSQL and DataFrames☆57Updated last year
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 4 years ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆29Updated 7 years ago
- Airflow workflow management platform chef cookbook.☆67Updated 5 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆262Updated last year
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆46Updated 4 years ago
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆134Updated 2 years ago
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated 9 months ago
- Astronomer Core Docker Images☆106Updated 3 months ago
- Python API for Deequ☆41Updated 3 years ago
- Python client for Marquez☆12Updated 3 years ago
- A tool and library for easily deploying applications on Apache YARN☆142Updated 6 months ago
- A Python client for Apache Livy, enabling use of remote Apache Spark clusters.☆70Updated 2 years ago
- Export Redshift data and convert to Parquet for use with Redshift Spectrum or other data warehouses.☆116Updated last year
- Accelerator to rapidly deploy customized features for your business☆55Updated 9 months ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 5 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated 11 months ago
- BigQuery Schema Conversion Tool☆23Updated 3 years ago
- Tools for creating Dataproc custom images☆33Updated last week
- CLI tool for syncing a Databricks folder structure with a local git repo.☆17Updated last month
- A Getting Started Guide for developing and using Airflow Plugins☆94Updated 5 years ago
- An open source library for BigQuery testing.☆14Updated 2 years ago