REST API for Apache Spark on K8S or YARN
☆110Dec 5, 2025Updated 3 months ago
Alternatives and similar repositories for lighter
Users that are interested in lighter are comparing it to the libraries listed below
Sorting:
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆134Jan 5, 2026Updated 2 months ago
- ☆17Jan 23, 2026Updated last month
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Sep 9, 2025Updated 5 months ago
- ☆10Jun 3, 2023Updated 2 years ago
- streaming data pipeline platform☆29Jan 4, 2026Updated 2 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆202Oct 20, 2022Updated 3 years ago
- Type class based library to read / write XML☆16Jul 1, 2020Updated 5 years ago
- Mirror of Apache livy (Incubating)☆13Feb 8, 2024Updated 2 years ago
- Exporter based on Hadoop clusters that use Ambari as their administrative tool, leveraging Ambari API to export cluster's metrics.☆21Jun 12, 2023Updated 2 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Sep 30, 2024Updated last year
- Theano bindings for Baidu's CTC library.☆20Aug 25, 2016Updated 9 years ago
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆946Feb 26, 2026Updated last week
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆188Aug 2, 2022Updated 3 years ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,106Updated this week
- A library that provides useful extensions to Apache Spark and PySpark.☆232Jan 20, 2026Updated last month
- Spark integrations for working with Lance datasets☆45Updated this week
- A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.☆25Sep 8, 2023Updated 2 years ago
- ☆376Updated this week
- Mirror of Apache Toree (Incubating)☆749Feb 21, 2026Updated last week
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆431Jan 14, 2022Updated 4 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆347May 31, 2024Updated last year
- Ansible playbooks for Apache Spark on kube☆27Jul 20, 2017Updated 8 years ago
- Open Control Plane for Tables in Data Lakehouse☆380Updated this week
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated this week
- Use dbt to manage real-time data transformations in RisingWave.☆35Feb 3, 2026Updated last month
- FederatedCatalog☆11Feb 24, 2026Updated last week
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆816Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆428May 5, 2025Updated 10 months ago
- Drop-in replacement for Apache Spark UI☆413Feb 17, 2026Updated 2 weeks ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆683Mar 6, 2025Updated 11 months ago
- My Study guide used to pass the CRT020 Spark Certification exam☆34Jan 6, 2020Updated 6 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,425Updated this week
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflows☆12Feb 26, 2026Updated last week
- This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The fictici…☆14Sep 30, 2024Updated last year
- This solution provides the AWS CDK and AWS CloudFormation infrastructure to build an enterprise data mesh with Amazon DataZone.☆10May 7, 2025Updated 9 months ago
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆151Feb 15, 2026Updated 2 weeks ago
- ☆81Apr 23, 2025Updated 10 months ago