HeartSaVioR/spark-state-tools

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HeartSaVioR/spark-state-tools)

HeartSaVioR / spark-state-tools

Spark Structured Streaming State Tools

☆34

Alternatives and similar repositories for spark-state-tools

Users that are interested in spark-state-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chermenin / spark-states
View on GitHub
Custom state store providers for Apache Spark
☆92Feb 14, 2025Updated last year
fqaiser94 / mse
View on GitHub
Make Structs Easy (MSE)
☆18Jun 22, 2020Updated 6 years ago
qubole / spark-state-store
View on GitHub
Rocksdb state storage implementation for Structured Streaming.
☆17Oct 21, 2020Updated 5 years ago
attilapiros / trace-agent
View on GitHub
A java agent for tracing which can be configured via simple text file and instruments the code without rebuilding the project.
☆51Jul 12, 2026Updated 2 weeks ago
hortonworks-spark / spark-hive-streaming-sink
View on GitHub
A sink to save Spark Structured Streaming DataFrame into Hive table
☆23May 7, 2018Updated 8 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
apache / incubator-retired-horn
View on GitHub
Mirror of Apache Horn (Incubating) ** This project has been retired **
☆28Apr 28, 2017Updated 9 years ago
HeartSaVioR / spark-sql-kafka-offset-committer
View on GitHub
Kafka offset committer for structured streaming query
☆41Feb 15, 2021Updated 5 years ago
redsk / neo_concept
View on GitHub
ConceptNet to neo4j 2.2
☆10Nov 6, 2015Updated 10 years ago
steveloughran / zero-rename-committer
View on GitHub
Paper: A Zero-rename committer for object stores
☆20Nov 7, 2025Updated 8 months ago
qubole / streaminglens
View on GitHub
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
☆17Jan 21, 2020Updated 6 years ago
zheyuan28 / SparkTaskMetrics
View on GitHub
Task Metrics Explorer
☆14Apr 2, 2019Updated 7 years ago
BenFradet / struct-type-encoder
View on GitHub
Deriving Spark DataFrame schemas from case classes
☆44Jun 24, 2024Updated 2 years ago
jerryshao / spark-kafka-0-8-sql
View on GitHub
Spark Structured Streaming Kafka 0.8 Source Implementation
☆35Apr 27, 2017Updated 9 years ago
apache / spark-website
View on GitHub
Apache Spark Website
☆140Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
datamechanics / delight
View on GitHub
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
☆345May 31, 2024Updated 2 years ago
rangareddy / ranga-java-oom
View on GitHub
Java OutOfMemory Example
☆11Jun 19, 2021Updated 5 years ago
JahstreetOrg / spark-on-kubernetes-helm
View on GitHub
Spark on Kubernetes infrastructure Helm charts repo
☆202Oct 20, 2022Updated 3 years ago
ibmruntimes / spelk
View on GitHub
Reporting Apache Spark metrics to Elasticsearch
☆13Aug 11, 2016Updated 9 years ago
sunng87 / clojuredocs-android
View on GitHub
An Android app for ClojureDocs
☆14Jan 27, 2012Updated 14 years ago
ColCarroll / working_ml
View on GitHub
Examples of applied machine learning
☆13Dec 27, 2017Updated 8 years ago
hortonworks-spark / cloud-integration
View on GitHub
Spark cloud integration: tests, cloud committers and more
☆20Jan 30, 2025Updated last year
vpon / protobuf-to-avro
View on GitHub
dynamically parse protobuf message then convert to avro
☆25May 27, 2015Updated 11 years ago
jparkie / Spark2Cassandra
View on GitHub
Spark Library for Bulk Loading into Cassandra
☆12Apr 18, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
japila-books / spark-structured-streaming-internals
View on GitHub
The Internals of Spark Structured Streaming
☆420Mar 3, 2026Updated 4 months ago
trek10inc / lambda-local-cache
View on GitHub
☆10Jul 5, 2016Updated 10 years ago
RedisLabs / spark-redis-ml
View on GitHub
A spark package for loading Spark ML models to Redis-ML
☆62Jun 22, 2019Updated 7 years ago
smart-inner / smarttune
View on GitHub
SmartTune is a black-box optimization that can automatically find good performance settings for a complex system's configuration knobs.
☆11Nov 23, 2022Updated 3 years ago
jamartinh / Orange3-Spark
View on GitHub
A set of widgets for Python's Orange Machine Learning to work with Apache Spark ML
☆15Dec 24, 2016Updated 9 years ago
jahstreet / incubator-livy
View on GitHub
Mirror of Apache livy (Incubating)
☆13Feb 8, 2024Updated 2 years ago
eastcirclek / swimlane-graphs
View on GitHub
Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs
☆13Feb 13, 2017Updated 9 years ago
qubole / spark-acid
View on GitHub
ACID Data Source for Apache Spark based on Hive ACID
☆97Jul 7, 2021Updated 5 years ago
AbsaOSS / hyperdrive
View on GitHub
Extensible streaming ingestion pipeline on top of Apache Spark
☆47Jul 17, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
maropu / spark-sql-flow-plugin
View on GitHub
Visualize column-level data lineage in Spark SQL
☆92May 13, 2022Updated 4 years ago
lightbend / flink-k8s-operator
View on GitHub
An example of building kubernetes operator (Flink) using Abstract operator's framework
☆26Jul 12, 2019Updated 7 years ago
colbyford / sparkitecture
View on GitHub
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
☆13Oct 27, 2021Updated 4 years ago
SemyonSinchenko / flake8-pyspark-with-column
View on GitHub
A flake8 plugin that detects of usage withColumn in a loop or inside reduce
☆28Jun 20, 2025Updated last year
phatak-dev / flink-examples
View on GitHub
Flink Examples
☆39Apr 27, 2016Updated 10 years ago
carlosescura / spark-history-server-helm
View on GitHub
Spark history server Helm Chart
☆22Mar 19, 2024Updated 2 years ago
japila-books / delta-lake-internals
View on GitHub
The Internals of Delta Lake
☆186Jun 18, 2026Updated last month