apple / batch-processing-gateway
The gateway component to make Spark on K8s much easier for Spark users.
☆187Updated 2 months ago
Alternatives and similar repositories for batch-processing-gateway:
Users that are interested in batch-processing-gateway are comparing it to the libraries listed below
- Apache Spark Kubernetes Operator☆106Updated this week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆326Updated last year
- Performance optimization for Spark running on Kubernetes☆87Updated 4 years ago
- Helm charts for Trino and Trino Gateway☆161Updated last week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆120Updated last week
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆59Updated 2 years ago
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆55Updated 2 years ago
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆209Updated this week
- ☆189Updated last week
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- A load balancer / proxy / gateway for prestodb☆357Updated 8 months ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆345Updated 10 months ago
- Setup for running Trino with Hive Metastore on Kubernetes☆100Updated 2 years ago
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated last year
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 3 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆76Updated last month
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆61Updated this week
- ☆40Updated last year
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆180Updated 2 years ago
- Kubernetes operator that provides control plane for managing Apache Flink applications☆570Updated 7 months ago
- An Extensible Data Skipping Framework☆43Updated 2 months ago
- Storage connector for Trino☆106Updated last week
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆39Updated 10 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆198Updated 2 years ago
- Apache Iceberg Documentation Site☆42Updated last year
- A playground to experience Gravitino☆41Updated last month
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆88Updated 11 months ago
- Multi-hop declarative data pipelines☆112Updated last week