apple / batch-processing-gateway
The gateway component to make Spark on K8s much easier for Spark users.
☆184Updated 2 weeks ago
Alternatives and similar repositories for batch-processing-gateway:
Users that are interested in batch-processing-gateway are comparing it to the libraries listed below
- Apache Spark Kubernetes Operator☆93Updated this week
- Helm charts for Trino and Trino Gateway☆158Updated this week
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆53Updated 2 years ago
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆206Updated this week
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- ☆179Updated this week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆118Updated 3 weeks ago
- Performance optimization for Spark running on Kubernetes☆86Updated 4 years ago
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆58Updated last year
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆345Updated 8 months ago
- Apache Iceberg Documentation Site☆42Updated last year
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆171Updated this week
- A load balancer / proxy / gateway for prestodb☆357Updated 6 months ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 2 months ago
- ☆40Updated last year
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆59Updated this week
- A Kubernetes Scheduler Extender to provide gang scheduling support for Spark on Kubernetes☆176Updated last year
- Setup for running Trino with Hive Metastore on Kubernetes☆99Updated 2 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆75Updated this week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- Spline agent for Apache Spark☆191Updated last week
- Multi-hop declarative data pipelines☆109Updated this week
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆38Updated 9 months ago
- pulsar lakehouse connector☆31Updated this week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆226Updated this week
- Kubernetes (K8s) Operator for PrestoDB☆46Updated 3 years ago
- The Internals of Delta Lake☆183Updated last month
- Management and automation platform for Stateful Distributed Systems☆104Updated this week