Spark package to "plug" holes in data using SQL based rules β‘οΈ π
β29May 15, 2020Updated 5 years ago
Alternatives and similar repositories for sparkplug
Users that are interested in sparkplug are comparing it to the libraries listed below
Sorting:
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.β115Mar 5, 2020Updated 5 years ago
- Scala Library for Reading Flat File Data (CSV/TSV/XLS/XLSX)β11Jul 13, 2023Updated 2 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafkaβ25Oct 16, 2020Updated 5 years ago
- Scala API for Apache Spark SQL high-order functionsβ14Aug 4, 2023Updated 2 years ago
- β14Jul 26, 2019Updated 6 years ago
- Scale GoCD Agents on demand with Dockerβ13Apr 15, 2018Updated 7 years ago
- Google Maps geocoding library for Scalaβ12Oct 12, 2019Updated 6 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Sparkβ29Nov 4, 2024Updated last year
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.β24Jun 20, 2017Updated 8 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream β¦β22Feb 6, 2017Updated 9 years ago
- A collection of Apache Parquet add-on modulesβ30Feb 12, 2026Updated 2 weeks ago
- Immutable DataTable implementation in Scalaβ70Dec 30, 2019Updated 6 years ago
- Make your joins typesafe againβ26Feb 5, 2026Updated 3 weeks ago
- Purely functional genetic algorithms for multi-objective optimisationβ73Jan 22, 2026Updated last month
- An extremely barebones boilerplate project for compiling Kotlin to Javascriptβ10May 25, 2017Updated 8 years ago
- FSelector R packageβ12Aug 22, 2023Updated 2 years ago
- Typesafe, purely functional Computational Intelligenceβ124Aug 5, 2022Updated 3 years ago
- Large-scale event processing with Akka Persistence and Apache Sparkβ273Jun 18, 2016Updated 9 years ago
- N-dimensional / multi-dimensional arrays (tensors) in Scala 3. Think NumPy ndarray / PyTorch Tensor but type-safe over shapes, array/axisβ¦β47Dec 22, 2022Updated 3 years ago
- Scala Math - Numerical (Matlab-like) and Symbolic (Mathematica-like) toolβ71Nov 25, 2019Updated 6 years ago
- Open source task scheduler with dependency managementβ15Jul 1, 2018Updated 7 years ago
- Scala library for accessing various file, batch systems, job schedulers and grid middlewares.β28Nov 26, 2025Updated 3 months ago
- Data-ish exploration through SQL+Uncertaintyβ27Oct 31, 2022Updated 3 years ago
- Build universally reusable web fragments on the JVMβ26Dec 5, 2023Updated 2 years ago
- Scala, DSL, Rules based reactive workflows and Microservicesβ14Oct 20, 2025Updated 4 months ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multipleβ¦β26Jun 7, 2021Updated 4 years ago
- RETIRED. Provides extension functions and features for smooth development with Bootique and Kotlin.β12Apr 28, 2024Updated last year
- RocksDB Ops CLIβ12Dec 17, 2016Updated 9 years ago
- GoCD plugins to work with MLFlow as model repository in a CD flowβ31Nov 1, 2023Updated 2 years ago
- Efficient diffing in Scalaβ61Nov 4, 2025Updated 3 months ago
- A library that brings useful functions from various modern database management systems to Apache Sparkβ61Sep 4, 2023Updated 2 years ago
- GraalVM native-image as a docker containerβ13Oct 11, 2018Updated 7 years ago
- A Scala Collection for Multiple Access Patternsβ12Oct 22, 2016Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLPβ15Oct 12, 2016Updated 9 years ago
- β13Nov 20, 2016Updated 9 years ago
- β11Nov 15, 2016Updated 9 years ago
- Scala wrappers for MapDBβ12Sep 9, 2017Updated 8 years ago
- A very simple, strongly typed, scala framework for tabular data. A collection of tuples. A strongly typed scala csv reader and writer. β¦β143Jul 19, 2019Updated 6 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easyβ63Dec 17, 2023Updated 2 years ago