kubeflow / mcp-apache-spark-history-serverLinks
MCP Server for Apache Spark History Server. The bridge between Agentic AI and Apache Spark.
☆107Updated this week
Alternatives and similar repositories for mcp-apache-spark-history-server
Users that are interested in mcp-apache-spark-history-server are comparing it to the libraries listed below
Sorting:
- Apache Spark Kubernetes Operator☆240Updated 2 weeks ago
- Drop-in replacement for Apache Spark UI☆361Updated this week
- ☆237Updated last week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆800Updated last month
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆130Updated last month
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- Apache DataFusion Comet Spark Accelerator☆1,072Updated this week
- REST API for Apache Spark on K8S or YARN☆108Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 7 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆230Updated last week
- Spline agent for Apache Spark☆200Updated last week
- Helm charts for Trino and Trino Gateway☆187Updated 2 weeks ago
- ☆269Updated last year
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Updated 9 months ago
- Remote shuffle service for Apache Spark to store shuffle data on remote servers.☆336Updated 2 years ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,368Updated this week
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆136Updated 2 years ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,066Updated this week
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆432Updated this week
- PyIceberg☆945Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,482Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆193Updated this week
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆39Updated 3 weeks ago
- The Internals of Spark SQL☆480Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆290Updated this week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆873Updated 2 weeks ago
- Trino Group Provider LDAP is a Trino (formerly Presto SQL) plugin to map user names to groups using an LDAP server☆23Updated last year
- A load balancer / proxy / gateway for prestodb☆357Updated last year
- Official Dockerfile for Apache Spark☆155Updated 2 weeks ago
- Qubole Sparklens tool for performance tuning Apache Spark☆586Updated last year