The gateway component to make Spark on K8s much easier for Spark users.
☆216Dec 16, 2025Updated 3 months ago
Alternatives and similar repositories for batch-processing-gateway
Users that are interested in batch-processing-gateway are comparing it to the libraries listed below
Sorting:
- Apache YuniKorn Release☆44Mar 12, 2026Updated last week
- Apache Spark Kubernetes Operator☆267Updated this week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆335Sep 29, 2023Updated 2 years ago
- Community Java bindings for https://github.com/facebookincubator/velox☆40Updated this week
- Apache YuniKorn Core☆1,004Updated this week
- Apache YuniKorn Web UI☆40Updated this week
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,109Mar 12, 2026Updated last week
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆446Mar 5, 2026Updated 2 weeks ago
- Apache DataFusion Comet Spark Accelerator☆1,153Mar 13, 2026Updated last week
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆56Jan 2, 2023Updated 3 years ago
- Apache Flink Kubernetes Operator☆995Mar 2, 2026Updated 2 weeks ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,530Updated this week
- Apache YuniKorn K8shim☆163Updated this week
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,039Mar 11, 2026Updated last week
- Apache Yunikorn website - see the master branch for instructions☆30Mar 12, 2026Updated last week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16Jan 4, 2026Updated 2 months ago
- This project provides a reverse proxy for Spark UI on Kubernetes☆17Oct 12, 2023Updated 2 years ago
- Apache YuniKorn Scheduler Interface☆34Mar 12, 2026Updated last week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,310Updated this week
- Performance optimization for Spark running on Kubernetes☆87Aug 18, 2020Updated 5 years ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆134Mar 13, 2026Updated last week
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆52Sep 17, 2025Updated 6 months ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆177Apr 23, 2023Updated 2 years ago
- Apache Iceberg Documentation Site☆42Feb 5, 2024Updated 2 years ago
- Remote Shuffle Service for Flink☆191Jan 6, 2023Updated 3 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆16Jul 24, 2023Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆892Mar 10, 2026Updated last week
- ☆18Nov 4, 2024Updated last year
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆816Mar 4, 2026Updated 2 weeks ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆967Updated this week
- Benchmarks for queries over continuous data streams.☆376Dec 26, 2025Updated 2 months ago
- Apache Iceberg☆8,636Updated this week
- A toolset for writing Kubernetes controllers, or operators, in Java.☆20Dec 14, 2022Updated 3 years ago
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 2 years ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,730Updated this week
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago