kubeflow / spark-operatorLinks
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆2,952Updated this week
Alternatives and similar repositories for spark-operator
Users that are interested in spark-operator are comparing it to the libraries listed below
Sorting:
- Apache YuniKorn Core☆940Updated this week
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆657Updated 2 years ago
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- Apache Flink Kubernetes Operator☆898Updated last week
- A Cloud Native Batch System (Project under CNCF)☆4,760Updated this week
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆913Updated last week
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆396Updated 8 months ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆572Updated 9 months ago
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆695Updated 8 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆203Updated 2 years ago
- Elastic Cloud on Kubernetes☆2,727Updated this week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,091Updated 2 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,206Updated this week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆764Updated 2 weeks ago
- Kafka cluster as Kubernetes StatefulSet, plain manifests and config☆1,839Updated last year
- Kubernetes Cluster Federation☆2,492Updated 2 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,088Updated this week
- Oh no! Yet another Apache Kafka operator for Kubernetes☆792Updated 3 months ago
- Druid Kubernetes Operator☆205Updated last year
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,433Updated last week
- Add-on agent to generate and expose cluster-level metrics.☆5,757Updated this week
- Event-driven Automation Framework for Kubernetes☆2,499Updated this week
- A resource tracking a number of Operators out in the wild.☆3,525Updated 3 years ago
- Base classes to use when writing tests with Spark☆1,535Updated 5 months ago
- Distributed ML Training and Fine-Tuning on Kubernetes☆1,817Updated this week
- Kubeflow Deployment Manifests☆922Updated this week
- Apache Spark to Apache Cassandra connector☆1,947Updated last month
- Apache Iceberg☆7,615Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,372Updated this week
- Kubebuilder - SDK for building Kubernetes APIs using CRDs☆8,506Updated this week