springnz/sparkplug

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/springnz/sparkplug)

springnz / sparkplug

A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.

☆47

Alternatives and similar repositories for sparkplug

Users that are interested in sparkplug are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KadekM / scrawler
View on GitHub
Scala web crawling and scraping using fs2 streams
☆16Aug 29, 2017Updated 8 years ago
apache / incubator-retired-mrql
View on GitHub
Mirror of Apache MRQL (Incubating)
☆17Aug 22, 2017Updated 8 years ago
cosminseceleanu / scala-pipeline
View on GitHub
Pipeline Pattern implementation in Scala
☆12Mar 18, 2018Updated 8 years ago
BenFradet / struct-type-encoder
View on GitHub
Deriving Spark DataFrame schemas from case classes
☆44Jun 24, 2024Updated 2 years ago
softwaremill / blog-scala-structure-lifecycle
View on GitHub
☆12Oct 7, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
TresAmigosSD / SMV
View on GitHub
Spark Modularized View
☆43May 25, 2020Updated 6 years ago
hammerlab / sbt-parent
View on GitHub
SBT plugins for publishing to Maven Central, shading and managing dependencies, reporting to Coveralls from TravisCI, and more
☆14Nov 13, 2020Updated 5 years ago
catie-aq / triton-rust
View on GitHub
An api for interfacing Nvidia Trition Inference Server with Rust
☆12Jun 12, 2023Updated 3 years ago
tresata / spark-skewjoin
View on GitHub
Joins for skewed datasets in Spark
☆58Aug 18, 2017Updated 8 years ago
McKalvan / SCRAPI
View on GitHub
☆12Feb 28, 2021Updated 5 years ago
levkhomich / activator-akka-tracing
View on GitHub
Activator template for akka-tracing project
☆23Jul 18, 2016Updated 10 years ago
bloomberg / spark-flow
View on GitHub
Library for organizing batch processing pipelines in Apache Spark
☆43Jan 4, 2017Updated 9 years ago
allenai / pipeline
View on GitHub
Library for building reproducible data pipelines to support experimentation
☆20Dec 16, 2015Updated 10 years ago
cloudml / zen
View on GitHub
Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…
☆169Nov 17, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
takezoe / tranquil
View on GitHub
Type-safe SQL builder for Scala
☆31Jun 4, 2026Updated last month
typelevel / frameless
View on GitHub
Expressive types for Spark.
☆898Updated this week
sjtu-iiot / graphx-algorithm
View on GitHub
Graph algorithms implemented in GraphX and Spark styles
☆15Apr 26, 2015Updated 11 years ago
MrPowers / spark-spec
View on GitHub
Test suite to document the behavior of Spark
☆21Apr 15, 2021Updated 5 years ago
intentmedia / mario
View on GitHub
Functional, Typesafe, Declarative Data Pipelines
☆140Jan 29, 2018Updated 8 years ago
adamw / reactive-akka-pres
View on GitHub
"Reactive Akka" presentation
☆30Aug 19, 2015Updated 10 years ago
FRosner / spawncamping-dds
View on GitHub
Data-Driven Spark allows quick data exploration based on Apache Spark.
☆29Jan 6, 2017Updated 9 years ago
koeninger / spark-cassandra-example
View on GitHub
Example usage of spark cassandra connector
☆25Nov 21, 2014Updated 11 years ago
univalence / spark-tools
View on GitHub
☆46Apr 27, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sequenceiq / docker-spark-native-yarn
View on GitHub
☆13Mar 8, 2018Updated 8 years ago
dhutchis / LaraDB
View on GitHub
A platform for unified linear and relational algebra analytics, built on the Accumulo NoSQL database
☆13Feb 9, 2022Updated 4 years ago
swoop-inc / spark-records
View on GitHub
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
☆73Mar 14, 2021Updated 5 years ago
Netflix / derand
View on GitHub
☆32Apr 10, 2023Updated 3 years ago
oscarrenalias / scalacheck-cookbook
View on GitHub
☆13Jul 29, 2024Updated last year
memsql / streamliner-starter
View on GitHub
Starter project for building MemSQL Streamliner Pipelines
☆32Apr 18, 2017Updated 9 years ago
UBOdin / jitd
View on GitHub
Just in Time Datastructures
☆11Feb 21, 2017Updated 9 years ago
dfinance / dvm
View on GitHub
dfinance Virtual Machine for Move language
☆25Jul 2, 2021Updated 5 years ago
swoop-inc / spark-alchemy
View on GitHub
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
☆191Oct 15, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
dylandoamaral / zio-selenium
View on GitHub
A ZIO wrapper to interact with a browser using Selenium.
☆17Jan 16, 2025Updated last year
looker-open-source / app-ml-accelerator
View on GitHub
Looker extension designed to give business users access to BigQuery and Vertex AI's machine learning capabilities.
☆17Jun 4, 2026Updated last month
ottogroup / schedoscope
View on GitHub
Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…
☆98Nov 14, 2019Updated 6 years ago
llooker / demo_segment
View on GitHub
☆16Oct 6, 2020Updated 5 years ago
FRosner / drunken-data-quality
View on GitHub
Spark package for checking data quality
☆220Feb 28, 2020Updated 6 years ago
FINRAOS / herd-ui
View on GitHub
Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…
☆16Oct 1, 2022Updated 3 years ago
collectivemedia / modelmatrix
View on GitHub
Sparse feature extraction with Spark
☆30Jul 25, 2018Updated 7 years ago