HeartSaVioR / spark-state-toolsView external linksLinks
Spark Structured Streaming State Tools
☆34Jul 3, 2020Updated 5 years ago
Alternatives and similar repositories for spark-state-tools
Users that are interested in spark-state-tools are comparing it to the libraries listed below
Sorting:
- Custom state store providers for Apache Spark☆92Feb 14, 2025Updated last year
- Make Structs Easy (MSE)☆18Jun 22, 2020Updated 5 years ago
- Task Metrics Explorer☆14Apr 2, 2019Updated 6 years ago
- Reporting Apache Spark metrics to Elasticsearch☆13Aug 11, 2016Updated 9 years ago
- ☆16Oct 17, 2024Updated last year
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Query Plan Markup Language☆45Jan 18, 2024Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 3 months ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Spark history server Helm Chart☆22Mar 19, 2024Updated last year
- Spark cloud integration: tests, cloud committers and more☆20Jan 30, 2025Updated last year
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 7 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆203Oct 20, 2022Updated 3 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Argument parsing in Scala☆84Mar 27, 2023Updated 2 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Sep 6, 2024Updated last year
- Remedy small files by combining them into larger ones.☆23Oct 31, 2018Updated 7 years ago
- The Internals of Spark Structured Streaming☆422Jan 25, 2026Updated 2 weeks ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆347May 31, 2024Updated last year
- Terraform modules for provisioning and managing AWS Glue resources☆34Dec 10, 2025Updated 2 months ago
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 2 years ago
- an experimental Scala extension of Jar Jar Links☆38Jan 20, 2026Updated 3 weeks ago
- Apache Spark Website☆133Updated this week
- I'll munch some data here☆12Jun 18, 2021Updated 4 years ago
- The ONS Big Data Team Github pages☆10May 19, 2021Updated 4 years ago
- delay message system, when message reaches its ready time, will delivery to kafka☆11Jul 30, 2021Updated 4 years ago
- ☆10Jul 1, 2022Updated 3 years ago
- Flink Examples☆38Apr 27, 2016Updated 9 years ago
- Spark on Kubernetes infrastructure Docker images repo☆38Oct 20, 2022Updated 3 years ago
- Reproducible Research in Finse☆10Aug 5, 2020Updated 5 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- ☆38Feb 28, 2018Updated 7 years ago
- ☆11Jan 28, 2019Updated 7 years ago
- A Scala library for locality sensitive hashing☆14Aug 1, 2018Updated 7 years ago
- Generating short scary stories through language model.☆13Apr 28, 2025Updated 9 months ago