Flowchart for debugging Spark applications
☆106Sep 25, 2024Updated last year
Alternatives and similar repositories for spark-flowchart
Users that are interested in spark-flowchart are comparing it to the libraries listed below
Sorting:
- An open source indexing subsystem that brings index-based query acceleration to Apache Spark™ and big data workloads.☆431Jan 14, 2022Updated 4 years ago
- Examples for High Performance Spark☆16Oct 25, 2025Updated 4 months ago
- Type safety for spark columns☆79Oct 27, 2025Updated 4 months ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 3 months ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆61Sep 4, 2023Updated 2 years ago
- Code snippets used in demos recorded for the blog.☆38Feb 17, 2026Updated 2 weeks ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆187Oct 15, 2025Updated 4 months ago
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 8 months ago
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆94May 9, 2025Updated 9 months ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- Activity Streams Parser for Python☆27Feb 29, 2012Updated 14 years ago
- Configuration for Nix on my macOS machines☆14Feb 26, 2026Updated last week
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Don't Panic. This guide will help you when it feels like the end of the world.☆30Feb 7, 2026Updated 3 weeks ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆816Updated this week
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆188Aug 2, 2022Updated 3 years ago
- Cost Efficient Data Pipelines with DuckDB☆62May 14, 2025Updated 9 months ago
- Run spark calculations from Ammonite☆117Feb 20, 2026Updated last week
- A curated list of awesome SQLMesh resources☆38Apr 30, 2025Updated 10 months ago
- Sample app to use ZIO and DIstage with a playframework application☆11Jan 15, 2021Updated 5 years ago
- Magic to help Spark pipelines upgrade☆34Sep 29, 2024Updated last year
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated 10 months ago
- Embedded Kafka for testing and quick prototyping.☆14Apr 19, 2016Updated 9 years ago
- Spark NLP for Streamlit☆15Sep 12, 2021Updated 4 years ago
- The Internals of Delta Lake☆188Nov 30, 2025Updated 3 months ago
- DataOps Observability is part of DataKitchen's Open Source Data Observability. DataOps Observability monitors every data journey from da…☆50Nov 5, 2025Updated 4 months ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆43Dec 4, 2023Updated 2 years ago
- HA, fault-tolerant, non-intrusive INotify for Hadoop HDFS☆18Apr 16, 2023Updated 2 years ago
- Library to run in process Kafka broker☆16Nov 20, 2018Updated 7 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆347May 31, 2024Updated last year
- Allowing engineers to work efficiently☆29Feb 8, 2026Updated 3 weeks ago
- Optimizing Databricks Workload, published by Packt☆18Jan 18, 2023Updated 3 years ago
- The Internals of Apache Spark☆1,541Jul 5, 2025Updated 8 months ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,588Feb 17, 2026Updated 2 weeks ago
- Gather system information about airflow processes☆18Mar 12, 2020Updated 5 years ago
- Realistic sample value generators for Scala.☆16Jul 4, 2024Updated last year
- Scala library for memory-efficient data structures☆74Oct 23, 2017Updated 8 years ago
- Open Source Secret Provider plugin for the Kafka Connect framework☆47Jul 19, 2024Updated last year