Apache Spark Kubernetes Operator
☆289Jun 23, 2026Updated last week
Alternatives and similar repositories for spark-kubernetes-operator
Users that are interested in spark-kubernetes-operator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The gateway component to make Spark on K8s much easier for Spark users.☆220May 6, 2026Updated last month
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,128Updated this week
- Official Dockerfile for Apache Spark☆169Jun 18, 2026Updated last week
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,053Updated this week
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,982Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆449May 27, 2026Updated last month
- Helm chart for Lakekeeper - a Rust Native Iceberg REST Catalog☆25Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,218Updated this week
- Docker image that builds a patched Apache Spark with AWS Glue support as metastore☆18Jun 8, 2024Updated 2 years ago
- A Kubernetes Operator for Lakekeeper (WIP)☆17Apr 30, 2026Updated 2 months ago
- Drop-in replacement for Apache Spark UI☆472Jun 2, 2026Updated 3 weeks ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,575Updated this week
- Functional programming for Java. Enhanced switch or simple pattern matching supported; String Interpolation supported; Java Functional In…☆12Apr 3, 2026Updated 2 months ago
- Apache Flink Kubernetes Operator☆1,013Jun 22, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open, Multi-modal Catalog for Data & AI☆3,436Jun 17, 2026Updated last week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,348Updated this week
- World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.☆3,031Updated this week
- Apache YuniKorn Release☆45Jun 19, 2026Updated last week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆96May 11, 2026Updated last month
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,772Updated this week
- trino monitoring with JMX metrics through Prometheus and Grafana☆17Aug 14, 2024Updated last year
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,145Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,471Updated this week
- Helm charts for Trino and Trino Gateway☆195Jun 22, 2026Updated last week
- Performance optimization for Spark running on Kubernetes☆87Aug 18, 2020Updated 5 years ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆986Updated this week
- Repository for Practical Data Pipeline Code☆11Feb 19, 2022Updated 4 years ago
- Apache YuniKorn Core☆1,016Jun 22, 2026Updated last week
- Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …☆3,313Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,369Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆85Apr 12, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆826May 19, 2026Updated last month
- The observability platform for Iceberg lakehouses.☆466Jan 12, 2026Updated 5 months ago
- Presto Helm Charts☆17Oct 28, 2025Updated 8 months ago
- Java SDK for building Kubernetes Operators☆932Updated this week
- Mock streaming data generator☆18May 31, 2024Updated 2 years ago
- ☆54Jun 18, 2026Updated last week
- MCP Server and CLI for Apache Spark History Server. Debug Spark applications from AI agents, scripts, or the terminal.☆178Updated this week