kubeflow / spark-operatorLinks
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆3,089Updated last week
Alternatives and similar repositories for spark-operator
Users that are interested in spark-operator are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆657Updated 3 years ago
- Apache Flink Kubernetes Operator☆974Updated last week
- Apache YuniKorn Core☆996Updated this week
- Kubernetes operator that provides control plane for managing Apache Flink applications☆582Updated 5 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆203Updated 3 years ago
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆707Updated last year
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆398Updated last year
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆939Updated last week
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆613Updated 6 years ago
- Apache Spark docker image☆2,061Updated 2 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,290Updated this week
- Kafka cluster as Kubernetes StatefulSet, plain manifests and config☆1,838Updated last year
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆223Updated last month
- Druid Kubernetes Operator☆208Updated last year
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆808Updated 2 months ago
- Apache Spark Kubernetes Operator☆250Updated this week
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond☆1,020Updated this week
- Altinity Kubernetes Operator for ClickHouse creates, configures and manages ClickHouse® clusters running on Kubernetes☆2,388Updated this week
- An Open Standard for lineage metadata collection☆2,255Updated this week
- Curated Big Data Applications for Kubernetes☆106Updated 2 years ago
- Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides…☆2,977Updated 2 months ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,395Updated this week
- Oh no! Yet another Apache Kafka operator for Kubernetes☆791Updated 10 months ago
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,093Updated last week
- Distributed AI Model Training and Fine-Tuning on Kubernetes☆2,003Updated this week
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,530Updated this week
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated 2 years ago
- ☆40Updated 5 years ago
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆666Updated 2 months ago
- A resource tracking a number of Operators out in the wild.☆3,524Updated 4 years ago