Apache Spark Kubernetes Operator
☆263Updated this week
Alternatives and similar repositories for spark-kubernetes-operator
Users that are interested in spark-kubernetes-operator are comparing it to the libraries listed below
Sorting:
- The gateway component to make Spark on K8s much easier for Spark users.☆215Dec 16, 2025Updated 2 months ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,106Updated this week
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆446Feb 12, 2026Updated 2 weeks ago
- Helm chart for Lakekeeper - a Rust Native Iceberg REST Catalog☆23Updated this week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆17Jan 4, 2026Updated last month
- Apache DataFusion Comet Spark Accelerator☆1,148Updated this week
- Functional programming for Java. Enhanced switch or simple pattern matching supported; String Interpolation supported; Java Functional In…☆11Jan 15, 2026Updated last month
- Drop-in replacement for Apache Spark UI☆413Feb 17, 2026Updated last week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,516Updated this week
- Oxia Java client SDK☆19Jan 29, 2026Updated last month
- Framework and tooling to support writing dynamic admission controllers and conversion hooks for Kubernetes in Java☆27Feb 13, 2026Updated 2 weeks ago
- Apache Flink Kubernetes Operator☆989Updated this week
- World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.☆2,882Feb 21, 2026Updated last week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,304Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,111Feb 20, 2026Updated last week
- ☆243Updated this week
- Kafka Connector for Iceberg tables☆16Jul 24, 2023Updated 2 years ago
- trino monitoring with JMX metrics through Prometheus and Grafana☆17Aug 14, 2024Updated last year
- Open Control Plane for Tables in Data Lakehouse☆380Updated this week
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Feb 27, 2024Updated 2 years ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,420Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94May 9, 2025Updated 9 months ago
- Mock streaming data generator☆17May 31, 2024Updated last year
- Helm Charts for RisingWave☆24Feb 9, 2026Updated 2 weeks ago
- Apache YuniKorn Core☆1,002Updated this week
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,715Updated this week
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆146Jan 21, 2026Updated last month
- Community Java bindings for https://github.com/facebookincubator/velox☆40Updated this week
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- Helm charts for Trino and Trino Gateway☆193Updated this week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆816Updated this week
- HashCats Auto Clicker is a versatile tool that enhances your gaming experience by automating various actions within the HashCats game☆18Updated this week
- Very simple serializer built for my purposes. Currently has built-in support for MessagePack and BSON.☆12May 2, 2020Updated 5 years ago
- Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processin…☆1,161Feb 20, 2026Updated last week
- Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …☆3,188Feb 20, 2026Updated last week
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆177Apr 23, 2023Updated 2 years ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆42Jan 19, 2026Updated last month
- Stackable Operator for Apache Kafka☆27Feb 20, 2026Updated last week