kubeflow / spark-operatorLinks
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆3,053Updated this week
Alternatives and similar repositories for spark-operator
Users that are interested in spark-operator are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆657Updated 3 years ago
- Apache YuniKorn Core☆982Updated last week
- Apache Flink Kubernetes Operator☆951Updated last week
- Kubernetes operator that provides control plane for managing Apache Flink applications☆580Updated 3 months ago
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆398Updated last year
- Spark on Kubernetes infrastructure Helm charts repo☆203Updated 3 years ago
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆613Updated 5 years ago
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆704Updated last year
- A Cloud Native Batch System (Project under CNCF)☆5,054Updated this week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,269Updated last week
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆931Updated this week
- Distributed AI Model Training and Fine-Tuning on Kubernetes☆1,957Updated this week
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆220Updated this week
- An Open Standard for lineage metadata collection☆2,178Updated this week
- Kafka cluster as Kubernetes StatefulSet, plain manifests and config☆1,840Updated last year
- Oh no! Yet another Apache Kafka operator for Kubernetes☆790Updated 8 months ago
- Apache Spark Kubernetes Operator☆227Updated this week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,092Updated 2 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,381Updated last week
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,058Updated last week
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond