A tool for data sampling, data generation, and data diffing
☆346Jan 8, 2026Updated 2 months ago
Alternatives and similar repositories for ratatool
Users that are interested in ratatool are comparing it to the libraries listed below
Sorting:
- A Scala feature transformation library for data science and machine learning☆473Feb 7, 2025Updated last year
- Scala Aggregators used for ML Model metrics monitoring☆91Sep 13, 2023Updated 2 years ago
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,620Feb 27, 2026Updated 3 weeks ago
- ☆23Jan 3, 2025Updated last year
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆297Jan 31, 2025Updated last year
- Shapeless utilities for common data types☆67Mar 14, 2026Updated last week
- Scio IDEA plugin☆30Oct 2, 2025Updated 5 months ago
- GCS support for avro-tools, parquet-tools and protobuf☆79May 5, 2025Updated 10 months ago
- Runs JVM closures in Docker containers on Kubernetes☆36Mar 23, 2018Updated 7 years ago
- A collection of Magnolia add-on modules☆182Feb 12, 2026Updated last month
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆270Jul 12, 2023Updated 2 years ago
- Abstract Algebra for Scala☆2,301Nov 21, 2025Updated 4 months ago
- http://www.scala-sbt.org/contraband/☆71Jan 12, 2026Updated 2 months ago
- A lightweight workflow definition library☆155Jul 15, 2022Updated 3 years ago
- Ephemeral Hadoop clusters using Google Compute Platform☆135Mar 31, 2022Updated 3 years ago
- Common library for serving TensorFlow, XGBoost and scikit-learn models in production.☆142Sep 11, 2023Updated 2 years ago
- An implementation of Huet’s Zipper for Scala and Scala.js that is intended to be usable in many common scenarios☆49Aug 18, 2024Updated last year
- Expressive types for Spark.☆896Updated this week
- Generic protobuf manipulation☆36Mar 3, 2026Updated 2 weeks ago
- Multisets for Scala☆86Jul 23, 2021Updated 4 years ago
- A collection of Apache Parquet add-on modules☆30Mar 3, 2026Updated 2 weeks ago
- A combinator library for integrating Functional Streams for Scala (FS2), Akka Streams and Apache Camel☆280Sep 3, 2024Updated last year
- Interop between fs2 and scalaz☆14Feb 9, 2018Updated 8 years ago
- Release with confidence, state-of-the-art property testing for Scala.☆267Dec 15, 2025Updated 3 months ago
- Binding between scodec and FS2☆55Oct 23, 2021Updated 4 years ago
- A Giter8 template for scio☆31Feb 3, 2026Updated last month
- A Scala compiler plugin to generate documentation from Scala source files.☆20Oct 18, 2021Updated 4 years ago
- A composable command-line parser for Scala.☆673Updated this week
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- Markdown documentation☆249Updated this week
- Avro schema generation and serialization / deserialization for Scala☆728Jan 28, 2026Updated last month
- Distributed Prometheus time series database☆1,459Updated this week
- Type-level & seamless command-line argument parsing for Scala☆309Feb 20, 2026Updated last month
- Google BigQuery support for Spark, SQL, and DataFrames☆155Dec 14, 2019Updated 6 years ago
- Mixin classes and traits dynamically☆10Sep 4, 2017Updated 8 years ago
- ☆11Nov 15, 2016Updated 9 years ago
- Tools for rewriting and optimizing DAGs (directed-acyclic graphs) in Scala☆151Mar 20, 2022Updated 4 years ago
- friendly little parsers☆356Aug 19, 2024Updated last year
- A lightweight reactive RPC-like system built on Akka IO☆45Apr 23, 2015Updated 10 years ago