Drop-in replacement for Apache Spark UI
☆432Mar 29, 2026Updated last week
Alternatives and similar repositories for spark
Users that are interested in spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆818Apr 1, 2026Updated last week
- Apache Spark Kubernetes Operator☆272Mar 31, 2026Updated last week
- Spark-Dashboard is an open-source monitoring solution for Apache Spark that provides real-time performance dashboards using containers an…☆134Apr 1, 2026Updated last week
- A library that provides useful extensions to Apache Spark and PySpark.☆235Mar 18, 2026Updated 3 weeks ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Mar 25, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆16Jul 25, 2025Updated 8 months ago
- Delta Lake helper methods. No Spark dependency.☆22Jan 19, 2026Updated 2 months ago
- Multi-hop declarative data pipelines☆127Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,544Updated this week
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,743Updated this week
- PySpark test helper methods with beautiful error messages☆759Updated this week
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,043Apr 2, 2026Updated last week
- Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processin…☆1,173Apr 1, 2026Updated last week
- ☆19Jun 15, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Open Control Plane for Tables in Data Lakehouse☆384Updated this week
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆447Apr 2, 2026Updated last week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆95May 9, 2025Updated 11 months ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆1,282Updated this week
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆136Oct 25, 2023Updated 2 years ago
- Delta lake and filesystem helper methods☆50Feb 29, 2024Updated 2 years ago
- A tool to benchmark L (loading) workloads within ETL workloads☆32Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346May 31, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆188Aug 2, 2022Updated 3 years ago
- Python API for Deequ☆814Mar 9, 2026Updated last month
- Open, Multi-modal Catalog for Data & AI☆3,348Updated this week
- Qubole Sparklens tool for performance tuning Apache Spark☆589Jun 26, 2024Updated last year
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,890Updated this week
- Terraform module to create AWS EMR resources 🇺🇦☆26Mar 11, 2026Updated 3 weeks ago
- ☆29Dec 5, 2025Updated 4 months ago
- Rust based high-performance Apache Uniffle shuffle-server☆64Apr 1, 2026Updated last week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,240Apr 3, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- WIP - Scaling Spark Data Platform with EKS. The solution uses Karpenter and Cluster Autoscaler, Yunikorn for advanced scheduling.☆16May 9, 2023Updated 2 years ago
- Dashboard for operating Flink jobs and deployments.☆44Jan 31, 2026Updated 2 months ago
- Monitoring and insights on your data lakehouse tables☆32Updated this week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆893Mar 25, 2026Updated 2 weeks ago
- Arrow-Powered Data Exchange☆15Feb 7, 2025Updated last year
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,110Mar 31, 2026Updated last week
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago