kubeflow / spark-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆2,878Updated this week
Alternatives and similar repositories for spark-operator:
Users that are interested in spark-operator are comparing it to the libraries listed below
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆658Updated 2 years ago
- Apache YuniKorn Core☆895Updated this week
- Apache Flink Kubernetes Operator☆847Updated last week
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆570Updated 6 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆904Updated 3 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,154Updated this week
- A Cloud Native Batch System (Project under CNCF)☆4,481Updated this week
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆397Updated 5 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆198Updated 2 years ago
- Apache Iceberg☆6,996Updated this week
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,354Updated last year
- Distributed ML Training and Fine-Tuning on Kubernetes☆1,705Updated this week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,086Updated last year
- Upserts, Deletes And Incremental Processing on Big Data.☆5,679Updated this week
- Kubeflow Deployment Manifests☆863Updated this week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆734Updated last month
- The Internals of Apache Spark☆1,492Updated 5 months ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,858Updated this week
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond☆934Updated this week
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆934Updated this week
- Kubernetes custom controller and CRDs to managing Airflow☆299Updated 4 years ago
- Oh no! Yet another Apache Kafka operator for Kubernetes☆791Updated 6 months ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,344Updated last week
- Kafka cluster as Kubernetes StatefulSet, plain manifests and config☆1,838Updated 9 months ago
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆680Updated 4 months ago
- A toolkit to run Ray applications on Kubernetes☆1,544Updated this week
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆870Updated this week
- Descheduler for Kubernetes☆4,699Updated this week
- Dremio - the missing link in modern data☆1,415Updated 4 months ago