Spark package to "plug" holes in data using SQL based rules β‘οΈ π
β29May 15, 2020Updated 5 years ago
Alternatives and similar repositories for sparkplug
Users that are interested in sparkplug are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.β116Mar 5, 2020Updated 6 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafkaβ25Oct 16, 2020Updated 5 years ago
- β14Jul 26, 2019Updated 6 years ago
- Scala Library for Reading Flat File Data (CSV/TSV/XLS/XLSX)β11Jul 13, 2023Updated 2 years ago
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.β24Jun 20, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Scala API for Apache Spark SQL high-order functionsβ14Aug 4, 2023Updated 2 years ago
- Google Maps geocoding library for Scalaβ12Oct 12, 2019Updated 6 years ago
- A dynamic data completeness and accuracy library at enterprise scale for Apache Sparkβ30Apr 15, 2026Updated 2 weeks ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream β¦β22Feb 6, 2017Updated 9 years ago
- Immutable DataTable implementation in Scalaβ70Dec 30, 2019Updated 6 years ago
- Large-scale event processing with Akka Persistence and Apache Sparkβ273Jun 18, 2016Updated 9 years ago
- Open source task scheduler with dependency managementβ15Jul 1, 2018Updated 7 years ago
- Redis search and indexing in Javaβ16Sep 26, 2016Updated 9 years ago
- A collection of Apache Parquet add-on modulesβ30Apr 15, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β11Nov 15, 2016Updated 9 years ago
- Scala, DSL, Rules based reactive workflows and Microservicesβ14Oct 20, 2025Updated 6 months ago
- Scala Math - Numerical (Matlab-like) and Symbolic (Mathematica-like) toolβ72Nov 25, 2019Updated 6 years ago
- Build universally reusable web fragments on the JVMβ27Dec 5, 2023Updated 2 years ago
- β13Nov 20, 2016Updated 9 years ago
- Typesafe, purely functional Computational Intelligenceβ125Updated this week
- Scala library for accessing various file, batch systems, job schedulers and grid middlewares.β29Apr 15, 2026Updated 2 weeks ago
- Library for building reproducible data pipelines to support experimentationβ20Dec 16, 2015Updated 10 years ago
- Purely functional genetic algorithms for multi-objective optimisationβ75Apr 16, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- N-dimensional / multi-dimensional arrays (tensors) in Scala 3. Think NumPy ndarray / PyTorch Tensor but type-safe over shapes, array/axisβ¦β48Dec 22, 2022Updated 3 years ago
- An extremely barebones boilerplate project for compiling Kotlin to Javascriptβ10May 25, 2017Updated 8 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easyβ64Dec 17, 2023Updated 2 years ago
- A library that brings useful functions from various modern database management systems to Apache Sparkβ62Sep 4, 2023Updated 2 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.β16Mar 16, 2016Updated 10 years ago
- Efficient diffing in Scalaβ61Nov 4, 2025Updated 5 months ago
- A collection of tools for Golangβ16Mar 27, 2019Updated 7 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-vizβ18Jul 3, 2017Updated 8 years ago
- Make your joins typesafe againβ26Feb 5, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An sbt plugin to fix java.lang.OutOfMemoryError: Metaspace/PermGen errors during interactive sbt usageβ14Feb 16, 2017Updated 9 years ago
- A Scala Collection for Multiple Access Patternsβ12Oct 22, 2016Updated 9 years ago
- Data-ish exploration through SQL+Uncertaintyβ27Oct 31, 2022Updated 3 years ago
- Generate mock data based on an Apache Avro schema and specific cardinality settingsβ10Apr 16, 2018Updated 8 years ago
- Apache Parquet reader in Scala without Apache Spark - developed at Purdue Universityβ12Feb 17, 2017Updated 9 years ago
- A very simple, strongly typed, scala framework for tabular data. A collection of tuples. A strongly typed scala csv reader and writer. β¦β145Jul 19, 2019Updated 6 years ago
- β21Mar 17, 2023Updated 3 years ago