apple/batch-processing-gateway

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apple/batch-processing-gateway)

apple / batch-processing-gateway

The gateway component to make Spark on K8s much easier for Spark users.

☆221

Alternatives and similar repositories for batch-processing-gateway

Users that are interested in batch-processing-gateway are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / yunikorn-release
View on GitHub
Apache YuniKorn Release
☆45Updated this week
uber / RemoteShuffleService
View on GitHub
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆335Sep 29, 2023Updated 2 years ago
apache / spark-kubernetes-operator
View on GitHub
Apache Spark Kubernetes Operator
☆301Updated this week
boostscale / velox4j
View on GitHub
Community Java bindings for https://github.com/facebookincubator/velox
☆43Updated this week
apache / yunikorn-web
View on GitHub
Apache YuniKorn Web UI
☆39Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
apache / yunikorn-core
View on GitHub
Apache YuniKorn Core
☆1,018Updated this week
kubeflow / spark-operator
View on GitHub
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
☆3,139Updated this week
apache / uniffle
View on GitHub
Uniffle is a high performance, general purpose Remote Shuffle Service.
☆451Updated this week
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,230Updated this week
apache / flink-kubernetes-operator
View on GitHub
Apache Flink Kubernetes Operator
☆1,021Updated this week
apache / kyuubi-client
View on GitHub
Client libraries of end users of Apache Kyuubi
☆11May 15, 2026Updated 2 months ago
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,576Updated this week
apache / yunikorn-site
View on GitHub
Apache Yunikorn website - see the master branch for instructions
☆30Jul 9, 2026Updated last week
apache / kyuubi-docker
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆16May 22, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
datapunchorg / spark-ui-reverse-proxy
View on GitHub
This project provides a reverse proxy for Spark UI on Kubernetes
☆16Oct 12, 2023Updated 2 years ago
apache / yunikorn-scheduler-interface
View on GitHub
Apache YuniKorn Scheduler Interface
☆34Updated this week
cerndb / spark-dashboard
View on GitHub
Spark-Dashboard is an open-source monitoring solution for Apache Spark that provides real-time performance dashboards using containers an…
☆137May 6, 2026Updated 2 months ago
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,352Updated this week
aws-samples / eks-spark-benchmark
View on GitHub
Performance optimization for Spark running on Kubernetes
☆87Aug 18, 2020Updated 5 years ago
apple / ml-batchquant
View on GitHub
☆23Oct 12, 2022Updated 3 years ago
apple / ml-interspeech2022-phi_rtn
View on GitHub
Repository accompanying the Interspeech 2022 publication titled "Space-Efficient Representation of Entity-centric Query Language Models" …
☆13Sep 8, 2022Updated 3 years ago
apple / ml-vfi-smiff
View on GitHub
☆14Nov 5, 2025Updated 8 months ago
liaco / mimir
View on GitHub
☆16Jul 25, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yaooqinn / spark-history-cli
View on GitHub
CLI tool for querying Apache Spark History Server REST API
☆28Mar 22, 2026Updated 3 months ago
IBM / spark-s3-shuffle
View on GitHub
A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.
☆52Sep 17, 2025Updated 10 months ago
apache / iceberg-docs
View on GitHub
Apache Iceberg Documentation Site
☆42Feb 5, 2024Updated 2 years ago
flink-extended / flink-remote-shuffle
View on GitHub
Remote Shuffle Service for Flink
☆189Jan 6, 2023Updated 3 years ago
palantir / k8s-spark-scheduler
View on GitHub
A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes
☆179Apr 23, 2023Updated 3 years ago
kubeflow / mcp-apache-spark-history-server
View on GitHub
MCP Server and CLI for Apache Spark History Server. Debug Spark applications from AI agents, scripts, or the terminal.
☆183Updated this week
apache / iceberg
View on GitHub
Apache Iceberg
☆9,062Updated this week
Tencent / Firestorm
View on GitHub
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…
☆256Apr 7, 2023Updated 3 years ago
LucaCanali / sparkMeasure
View on GitHub
This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…
☆827May 19, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
maropu / spark-sql-flow-plugin
View on GitHub
Visualize column-level data lineage in Spark SQL
☆92May 13, 2022Updated 4 years ago
tj--- / iceberg-demo
View on GitHub
A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino
☆22May 30, 2022Updated 4 years ago
oap-project / gazelle_plugin
View on GitHub
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
☆255Feb 21, 2023Updated 3 years ago
oap-project / remote-shuffle
View on GitHub
Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…
☆21Mar 15, 2024Updated 2 years ago
zuston / riffle
View on GitHub
Rust based high-performance Apache Uniffle shuffle-server
☆70Updated this week
adobe / lake-pulse
View on GitHub
A Rust library for analyzing data lake table health — checking the pulse — across multiple formats (Delta Lake, Apache Iceberg, Apache Hu…
☆20Jul 11, 2026Updated last week