kubeflow / spark-operatorLinks
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆3,001Updated this week
Alternatives and similar repositories for spark-operator
Users that are interested in spark-operator are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆658Updated 2 years ago
- Apache Flink Kubernetes Operator☆922Updated this week
- Apache YuniKorn Core☆966Updated this week
- Kubernetes operator that provides control plane for managing Apache Flink applications☆575Updated 2 weeks ago
- Spark on Kubernetes infrastructure Helm charts repo☆204Updated 2 years ago
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆397Updated 10 months ago
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆696Updated 10 months ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆923Updated last week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,232Updated last week
- Distributed ML Training and Fine-Tuning on Kubernetes☆1,891Updated this week
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆215Updated this week
- A toolkit to run Ray applications on Kubernetes☆1,991Updated last week
- Druid Kubernetes Operator☆206Updated last year
- A Cloud Native Batch System (Project under CNCF)☆4,885Updated this week
- Apache Spark docker image☆2,054Updated 2 years ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated 2 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,215Updated last week
- Kubeflow Deployment Manifests☆936Updated last week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆784Updated last week
- Curated Big Data Applications for Kubernetes☆103Updated 2 years ago
- Apache Spark Kubernetes Operator☆205Updated last week
- Apache Ranger - To enable, monitor and manage comprehensive data security across the Hadoop platform and beyond☆977Updated this week
- The Internals of Apache Spark☆1,514Updated last month
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆655Updated last week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,092Updated 2 years ago
- Upserts, Deletes And Incremental Processing on Big Data.☆5,912Updated last week
- Apache Iceberg☆7,855Updated this week
- Oh no! Yet another Apache Kafka operator for Kubernetes☆791Updated 5 months ago
- Altinity Kubernetes Operator for ClickHouse creates, configures and manages ClickHouse® clusters running on Kubernetes☆2,194Updated this week