apache / spark-kubernetes-operator
Apache Spark Kubernetes Operator
☆85Updated 3 weeks ago
Alternatives and similar repositories for spark-kubernetes-operator:
Users that are interested in spark-kubernetes-operator are comparing it to the libraries listed below
- The gateway component to make Spark on K8s much easier for Spark users.☆184Updated 7 months ago
- Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.☆204Updated this week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆118Updated last week
- Helm charts for Trino and Trino Gateway☆157Updated last week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆73Updated this week
- The Internals of Spark on Kubernetes☆70Updated 2 years ago
- Trino Connector for Apache Paimon.☆31Updated last month
- Framework for running macro benchmarks in a clustered environment☆32Updated this week
- ☆40Updated last year
- Performance optimization for Spark running on Kubernetes☆85Updated 4 years ago
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆53Updated 2 years ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆327Updated last year
- ☆176Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆86Updated 9 months ago
- Storage connector for Trino☆103Updated last week
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆38Updated 8 months ago
- Official Dockerfile for Apache Spark☆121Updated last month
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated last year
- A re-implementation of Hadoop DistCP in Apache Spark☆47Updated last year
- Apache Iceberg Documentation Site☆41Updated 11 months ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated 3 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆13Updated last month
- Apache flink☆141Updated last month
- Spline agent for Apache Spark☆190Updated 3 weeks ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆299Updated last year
- An Extensible Data Skipping Framework☆43Updated 2 weeks ago
- Instructions for getting started with Ververica Platform on minikube.☆90Updated last week
- Setup for running Trino with Hive Metastore on Kubernetes☆99Updated 2 years ago
- ☆52Updated this week
- Benchmarks for Apache Flink☆173Updated this week