kubeflow / mcp-apache-spark-history-serverLinks
MCP Server for Apache Spark History Server. The bridge between Agentic AI and Apache Spark.
☆98Updated 3 weeks ago
Alternatives and similar repositories for mcp-apache-spark-history-server
Users that are interested in mcp-apache-spark-history-server are comparing it to the libraries listed below
Sorting:
- Drop-in replacement for Apache Spark UI☆341Updated 2 weeks ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆795Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆130Updated last week
- Apache Spark Kubernetes Operator☆227Updated last week
- ☆269Updated last year
- A library that provides useful extensions to Apache Spark and PySpark.☆230Updated last week
- ☆233Updated last week
- Apache DataFusion Comet Spark Accelerator☆1,065Updated this week
- Spline agent for Apache Spark☆199Updated last week
- The Internals of Spark SQL☆477Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆587Updated last year
- Trino Group Provider LDAP is a Trino (formerly Presto SQL) plugin to map user names to groups using an LDAP server☆23Updated last year
- Qubole Sparklens tool for performance tuning Apache Spark☆585Updated last year
- A load balancer / proxy / gateway for prestodb☆357Updated last year
- A Python Library to support running data quality rules while the spark job is running⚡☆191Updated this week
- REST API for Apache Spark on K8S or YARN☆106Updated last week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,360Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,025Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,471Updated this week
- Official Dockerfile for Apache Spark☆151Updated 2 weeks ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Updated 7 months ago
- PyIceberg☆913Updated last week
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆38Updated this week
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆136Updated 2 years ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆251Updated 2 months ago
- A Python package to submit and manage Apache Spark applications on Kubernetes.☆44Updated 3 months ago
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆225Updated 7 months ago
- Helm charts for Trino and Trino Gateway☆184Updated last week
- Custom PySpark Data Sources☆79Updated last week