brunocfnba / Kubernetes-Airflow
Setup Apache Airflow on Kubernetes
☆10Updated 6 years ago
Alternatives and similar repositories for Kubernetes-Airflow:
Users that are interested in Kubernetes-Airflow are comparing it to the libraries listed below
- Guide on how to setup Apache Airflow containers using Docker and IBM Bluemix☆11Updated 7 years ago
- A pyspark lib to validate data quality☆18Updated 2 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 4 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated last month
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Presto Trino with Apache Hive Postgres metastore☆40Updated 6 months ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 2 months ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- Airflow on Kubernetes Operator☆89Updated 2 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆37Updated 7 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- ☆20Updated 3 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- ☆11Updated 5 years ago
- Airflow code accompanying blog post.☆21Updated 6 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆25Updated 7 years ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- ☆24Updated 4 years ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆174Updated last year
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 3 years ago
- A working airflow-on-k8s deployment for demos☆8Updated 6 years ago
- Creates simple data models on Snowflake to report dbt source freshness and tests☆24Updated last year