kubeflow / spark-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆2,842Updated last week
Alternatives and similar repositories for spark-operator:
Users that are interested in spark-operator are comparing it to the libraries listed below
- [DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆657Updated 2 years ago
- Apache Spark enhanced with native Kubernetes scheduler back-end: NOTE this repository is being ARCHIVED as all new development for the ku…☆612Updated 5 years ago
- Apache Flink Kubernetes Operator☆829Updated this week
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆898Updated 2 months ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆569Updated 4 months ago
- Apache YuniKorn Core☆882Updated this week
- The User-Community Airflow Helm Chart is the standard way to deploy Apache Airflow on Kubernetes with Helm. Originally created in 2017, i…☆671Updated 3 months ago
- Repository holding configuration files for running an HDFS cluster in Kubernetes☆397Updated 3 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,130Updated this week
- A batch scheduler of kubernetes for high performance workload, e.g. AI/ML, BigData, HPC☆1,083Updated last year
- Spark on Kubernetes infrastructure Helm charts repo☆200Updated 2 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆7,755Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆5,555Updated this week
- The Internals of Apache Spark☆1,487Updated 4 months ago
- A Cloud Native Batch System (Project under CNCF)☆4,361Updated this week
- Oh no! Yet another Apache Kafka operator for Kubernetes☆788Updated 5 months ago
- Apache Kafka® running on Kubernetes☆4,950Updated this week
- Bigtop is an Apache Foundation project for Infrastructure Engineers and Data Scientists looking for comprehensive packaging, testing, and…☆625Updated last week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆719Updated 5 months ago
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆201Updated this week
- Apache Iceberg☆6,767Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,249Updated this week
- Jupyter magics and kernels for working with remote Spark clusters☆1,338Updated 3 weeks ago
- Kubernetes-native Job Queueing☆1,558Updated this week
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated 2 years ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆848Updated this week
- Kafka cluster as Kubernetes StatefulSet, plain manifests and config☆1,838Updated 7 months ago
- Kubernetes Cluster Federation☆2,505Updated last year