brunocfnba / Kubernetes-Airflow
Setup Apache Airflow on Kubernetes
☆9Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Kubernetes-Airflow
- Guide on how to setup Apache Airflow containers using Docker and IBM Bluemix☆11Updated 6 years ago
- Data validation library for PySpark 3.0.0☆34Updated 2 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 years ago
- Airflow on Kubernetes Operator☆89Updated last year
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 3 years ago
- ☆54Updated 7 years ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- Example Spark applications that run on Kubernetes and access GCP products, e.g., GCS, BigQuery, and Cloud PubSub☆36Updated 6 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆72Updated last year
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 4 years ago
- Apache Airflow CI pipeline☆18Updated 5 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- ☆48Updated 2 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆173Updated last year
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- ☆11Updated 5 years ago
- A K8s-based infrastructure for analytics☆24Updated 4 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- Astronomer Core Docker Images☆106Updated 5 months ago
- Base project for creating Python Apache Beam pipelines and running them in Google DataFlow using CRON scheduler☆23Updated 7 years ago
- spark on kubernetes☆105Updated last year
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆146Updated 7 years ago
- Pylint plugin for static code analysis on Airflow code☆90Updated 4 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago