The gateway component to make Spark on K8s much easier for Spark users.
☆220May 6, 2026Updated last month
Alternatives and similar repositories for batch-processing-gateway
Users that are interested in batch-processing-gateway are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Apache YuniKorn Release☆45Jun 19, 2026Updated last week
- Apache Spark Kubernetes Operator☆289Jun 23, 2026Updated last week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆335Sep 29, 2023Updated 2 years ago
- Apache YuniKorn Core☆1,016Jun 22, 2026Updated last week
- Apache YuniKorn Web UI☆39Jun 22, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,128Updated this week
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆449May 27, 2026Updated last month
- Apache DataFusion Comet Spark Accelerator☆1,218Updated this week
- Apache Flink Kubernetes Operator☆1,013Jun 22, 2026Updated last week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,575Updated this week
- Apache YuniKorn K8shim☆163Jun 22, 2026Updated last week
- Client libraries of end users of Apache Kyuubi☆11May 15, 2026Updated last month
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,053Updated this week
- Apache Yunikorn website - see the master branch for instructions☆30Jun 10, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16May 22, 2026Updated last month
- This project provides a reverse proxy for Spark UI on Kubernetes☆16Oct 12, 2023Updated 2 years ago
- Apache YuniKorn Scheduler Interface☆34Jun 22, 2026Updated last week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,348Updated this week
- Performance optimization for Spark running on Kubernetes☆87Aug 18, 2020Updated 5 years ago
- Spark-Dashboard is an open-source monitoring solution for Apache Spark that provides real-time performance dashboards using containers an…☆137May 6, 2026Updated last month
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆52Sep 17, 2025Updated 9 months ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆179Apr 23, 2023Updated 3 years ago
- Apache Iceberg Documentation Site☆42Feb 5, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Remote Shuffle Service for Flink☆190Jan 6, 2023Updated 3 years ago
- Apache Spark - A unified analytics engine for large-scale data processing☆16Jul 24, 2023Updated 2 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆905Jun 9, 2026Updated 3 weeks ago
- ☆18May 7, 2026Updated last month
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆826May 19, 2026Updated last month
- Benchmarks for queries over continuous data streams.☆385Dec 26, 2025Updated 6 months ago
- A toolset for writing Kubernetes controllers, or operators, in Java.☆20Dec 14, 2022Updated 3 years ago
- Apache Iceberg☆8,988Updated this week
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆986Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 4 years ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,772Updated this week
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆255Feb 21, 2023Updated 3 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 4 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Mar 15, 2024Updated 2 years ago
- A composable and fully extensible C++ execution engine library for data management systems.☆4,156Updated this week
- A Cloud Native Batch System (Project under CNCF)☆5,714Updated this week