kubeflow / spark-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆2,913Updated last week
Alternatives and similar repositories for spark-operator:
Users that are interested in spark-operator are comparing it to the libraries listed below
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆658Updated 2 years ago
- Apache Flink Kubernetes Operator☆875Updated this week
- Apache YuniKorn Core☆923Updated this week
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆570Updated 8 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆911Updated 2 weeks ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,181Updated last week
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆690Updated 6 months ago
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆396Updated 7 months ago
- Upserts, Deletes And Incremental Processing on Big Data.☆5,755Updated this week
- An open protocol for secure data sharing☆833Updated this week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,091Updated last year
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,997Updated this week
- Apache Iceberg☆7,334Updated this week
- NVIDIA device plugin for Kubernetes☆3,196Updated this week
- Spark on Kubernetes infrastructure Helm charts repo☆201Updated 2 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,338Updated this week
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆947Updated last week
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond☆948Updated last week
- A Cloud Native Batch System (Project under CNCF)☆4,627Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,197Updated this week
- Distributed ML Training and Fine-Tuning on Kubernetes☆1,775Updated this week
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆213Updated 2 weeks ago
- Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides…☆2,840Updated last week
- Kafka cluster as Kubernetes StatefulSet, plain manifests and config☆1,837Updated 11 months ago
- Oh no! Yet another Apache Kafka operator for Kubernetes☆794Updated last month
- Kubeflow Deployment Manifests☆912Updated this week
- Dremio - the missing link in modern data☆1,427Updated last week
- Altinity Kubernetes Operator for ClickHouse creates, configures and manages ClickHouse® clusters running on Kubernetes☆2,106Updated this week
- Kubernetes Cluster Federation☆2,498Updated 2 years ago