Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also allows extending the Spark metrics systems with user-provided monitoring probes.
☆96May 11, 2026Updated last month
Alternatives and similar repositories for SparkPlugins
Users that are interested in SparkPlugins are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark-Dashboard is an open-source monitoring solution for Apache Spark that provides real-time performance dashboards using containers an…☆135May 6, 2026Updated last month
- ☆10Jun 29, 2021Updated 4 years ago
- This repository contains the development code for sparkMeasure, an Apache Spark performance analysis and troubleshooting library. It simp…☆825May 19, 2026Updated 3 weeks ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆63Sep 4, 2023Updated 2 years ago
- A library that provides useful extensions to Apache Spark and PySpark.☆238Jun 5, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark w…☆16May 21, 2026Updated 3 weeks ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 7 months ago
- Qubole Sparklens tool for performance tuning Apache Spark☆591Jun 26, 2024Updated last year
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated last year
- Spark SQL index for Parquet tables☆134May 6, 2021Updated 5 years ago
- Spark RAPIDS plugin - accelerate Apache Spark with GPUs☆978Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,569Updated this week
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆430Jan 14, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The Internals of Delta Lake☆186May 10, 2026Updated last month
- REST job server for Apache Spark☆44May 23, 2025Updated last year
- Apache DataFusion Comet Spark Accelerator☆1,209Updated this week
- A tool to validate data, built around Apache Spark.☆102Jun 8, 2026Updated last week
- Essential Spark extensions and helper methods ✨😲☆767Sep 14, 2025Updated 9 months ago
- Typesafe wrapper for Apache Spark DataFrame API☆144Jan 24, 2026Updated 4 months ago
- This repo contains examples of high throughput ingestion using Apache Spark and Apache Iceberg. These examples cover IoT and CDC scenario…☆28Mar 17, 2026Updated 2 months ago
- DataTunnel 是一个基于spark引擎的超高性能的分布式数据集成软件,支持海量数据的同步。基于spark extensions 扩展的DSL语法,结合的Spark SQL,更加便捷融入数仓 ETLT 过程中,简单易用。☆36Jun 4, 2026Updated last week
- Expressive types for Spark.☆898Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Spark plugin for CPU and memory profiling☆21Mar 17, 2026Updated 2 months ago
- A Python Library to support running data quality rules while the spark job is running⚡☆202May 19, 2026Updated 3 weeks ago
- Sample processing code using Spark 2.1+ and Scala☆51Jun 28, 2020Updated 5 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆97Jul 7, 2021Updated 4 years ago
- Using log4j insert log info into ElasticSearch☆26Oct 31, 2016Updated 9 years ago
- type-class based data cleansing library for Apache Spark SQL☆79Jun 23, 2019Updated 6 years ago
- Mirror of Apache DataFu☆124May 18, 2026Updated 3 weeks ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆456Apr 2, 2026Updated 2 months ago
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,771Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆64Nov 8, 2019Updated 6 years ago
- 手把手教你使用JavaNIO构建Actor模式的高性能分布式系统☆16Apr 16, 2018Updated 8 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- The Internals of Spark SQL☆487Jan 25, 2026Updated 4 months ago
- Apache Spark Kubernetes Operator☆289Updated this week
- spark structured streaming via HTTP communication☆18Jul 7, 2022Updated 3 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆30May 13, 2026Updated last month