apple / batch-processing-gateway
The gateway component to make Spark on K8s much easier for Spark users.
☆184Updated 6 months ago
Alternatives and similar repositories for batch-processing-gateway:
Users that are interested in batch-processing-gateway are comparing it to the libraries listed below
- Apache Spark Kubernetes Operator☆83Updated 2 weeks ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆201Updated this week
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆53Updated 2 years ago
- Performance optimization for Spark running on Kubernetes☆85Updated 4 years ago
- Helm charts for Trino and Trino Gateway☆156Updated this week
- Apache Iceberg Documentation Site☆41Updated 11 months ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆118Updated last month
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆58Updated last year
- ☆175Updated this week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆298Updated last year
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated last month
- A load balancer / proxy / gateway for prestodb☆357Updated 5 months ago
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆569Updated 4 months ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated last year
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆345Updated 7 months ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆168Updated this week
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆59Updated this week
- Setup for running Trino with Hive Metastore on Kubernetes☆99Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆70Updated this week
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆72Updated this week
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆38Updated 8 months ago
- A playground to experience Gravitino☆39Updated last week
- Trino Connector for Apache Paimon.☆31Updated last month
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- ☆52Updated this week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆215Updated this week