Spark Structured Streaming State Tools
☆34Jul 3, 2020Updated 5 years ago
Alternatives and similar repositories for spark-state-tools
Users that are interested in spark-state-tools are comparing it to the libraries listed below
Sorting:
- Custom state store providers for Apache Spark☆92Feb 14, 2025Updated last year
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23May 7, 2018Updated 7 years ago
- Make Structs Easy (MSE)☆18Jun 22, 2020Updated 5 years ago
- Reporting Apache Spark metrics to Elasticsearch☆13Aug 11, 2016Updated 9 years ago
- Task Metrics Explorer☆14Apr 2, 2019Updated 6 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- Rocksdb state storage implementation for Structured Streaming.☆17Oct 21, 2020Updated 5 years ago
- Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines☆17Jan 21, 2020Updated 6 years ago
- Kafka offset committer for structured streaming query☆40Feb 15, 2021Updated 5 years ago
- Query Plan Markup Language☆45Jan 18, 2024Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 4 months ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Spark history server Helm Chart☆22Mar 19, 2024Updated last year
- Spark cloud integration: tests, cloud committers and more☆20Jan 30, 2025Updated last year
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 8 months ago
- Spark on Kubernetes infrastructure Helm charts repo☆202Oct 20, 2022Updated 3 years ago
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- ACID Data Source for Apache Spark based on Hive ACID☆96Jul 7, 2021Updated 4 years ago
- Argument parsing in Scala☆84Mar 27, 2023Updated 2 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Sep 6, 2024Updated last year
- The Internals of Spark Structured Streaming☆422Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆347May 31, 2024Updated last year
- Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.☆70May 8, 2023Updated 2 years ago
- an experimental Scala extension of Jar Jar Links☆38Feb 23, 2026Updated last week
- Apache Spark Website☆134Feb 27, 2026Updated last week
- ☆10Aug 23, 2023Updated 2 years ago
- ☆10Jul 1, 2022Updated 3 years ago
- I'll munch some data here☆12Jun 18, 2021Updated 4 years ago
- delay message system, when message reaches its ready time, will delivery to kafka☆11Jul 30, 2021Updated 4 years ago
- Parent repository for the MOJ Analytics Platform☆14Nov 16, 2021Updated 4 years ago
- Flink Examples☆38Apr 27, 2016Updated 9 years ago
- Spark on Kubernetes infrastructure Docker images repo☆37Oct 20, 2022Updated 3 years ago
- kudu学习的一些资料,以及和spark/impala的集成使用☆33Sep 11, 2017Updated 8 years ago
- BlockChain DApp using Angular☆10Sep 24, 2018Updated 7 years ago
- ☆10Jul 5, 2016Updated 9 years ago
- ☆10Mar 31, 2021Updated 4 years ago
- Generating short scary stories through language model.☆14Apr 28, 2025Updated 10 months ago
- Reproducible Analytical Pipeline of the Hospital Standardised Mortality Ratio (HSMR) quarterly publication☆11Jun 21, 2024Updated last year
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago