spotify/ratatool

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/spotify/ratatool)

spotify / ratatool

A tool for data sampling, data generation, and data diffing

☆349

Alternatives and similar repositories for ratatool

Users that are interested in ratatool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spotify / featran
View on GitHub
A Scala feature transformation library for data science and machine learning
☆475Feb 7, 2025Updated last year
spotify / noether
View on GitHub
Scala Aggregators used for ML Model metrics monitoring
☆93Sep 13, 2023Updated 2 years ago
spotify / scio
View on GitHub
A Scala API for Apache Beam and Google Cloud Dataflow.
☆2,626Jul 14, 2026Updated last week
spotify / elitzur
View on GitHub
☆23Jan 3, 2025Updated last year
spotify / big-data-rosetta-code
View on GitHub
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
☆297Jan 31, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nevillelyh / shapeless-datatype
View on GitHub
Shapeless utilities for common data types
☆67Jul 2, 2026Updated 2 weeks ago
spotify / scio-idea-plugin
View on GitHub
Scio IDEA plugin
☆30Oct 2, 2025Updated 9 months ago
spotify / gcs-tools
View on GitHub
GCS support for avro-tools, parquet-tools and protobuf
☆79Jul 14, 2026Updated last week
spotify / hype
View on GitHub
Runs JVM closures in Docker containers on Kubernetes
☆38Mar 23, 2018Updated 8 years ago
spotify / magnolify
View on GitHub
A collection of Magnolia add-on modules
☆180Updated this week
spotify / styx
View on GitHub
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
☆271Jul 12, 2023Updated 3 years ago
sbt / contraband
View on GitHub
http://www.scala-sbt.org/contraband/
☆71Jul 10, 2026Updated last week
twitter / algebird
View on GitHub
Abstract Algebra for Scala
☆2,299Nov 21, 2025Updated 8 months ago
spotify / flo
View on GitHub
A lightweight workflow definition library
☆156Jul 15, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
spotify / zoltar
View on GitHub
Common library for serving TensorFlow, XGBoost and scikit-learn models in production.
☆143Sep 11, 2023Updated 2 years ago
spotify / spydra
View on GitHub
Ephemeral Hadoop clusters using Google Compute Platform
☆136Mar 31, 2022Updated 4 years ago
stanch / zipper
View on GitHub
An implementation of Huet’s Zipper for Scala and Scala.js that is intended to be usable in many common scenarios
☆49Aug 18, 2024Updated last year
typelevel / frameless
View on GitHub
Expressive types for Spark.
☆898Updated this week
nevillelyh / protobuf-generic
View on GitHub
Generic protobuf manipulation
☆38Updated this week
runarorama / scala-mset
View on GitHub
Multisets for Scala
☆87Jul 23, 2021Updated 4 years ago
functional-streams-for-scala / fs2-scalaz
View on GitHub
Interop between fs2 and scalaz
☆14Feb 9, 2018Updated 8 years ago
scodec / scodec-stream
View on GitHub
Binding between scodec and FS2
☆54Oct 23, 2021Updated 4 years ago
hedgehogqa / scala-hedgehog
View on GitHub
Release with confidence, state-of-the-art property testing for Scala.
☆268Jun 29, 2026Updated 3 weeks ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
krasserm / streamz
View on GitHub
A combinator library for integrating Functional Streams for Scala (FS2), Akka Streams and Apache Camel
☆280Sep 3, 2024Updated last year
spotify / scio.g8
View on GitHub
A Giter8 template for scio
☆31Feb 3, 2026Updated 5 months ago
criteo / socco
View on GitHub
A Scala compiler plugin to generate documentation from Scala source files.
☆20Oct 18, 2021Updated 4 years ago
nevillelyh / parquet-extra
View on GitHub
A collection of Apache Parquet add-on modules
☆31Updated this week
bkirwi / decline
View on GitHub
A composable command-line parser for Scala.
☆679Apr 8, 2026Updated 3 months ago
sksamuel / avro4s
View on GitHub
Avro schema generation and serialization / deserialization for Scala
☆730May 22, 2026Updated last month
lightbend / paradox
View on GitHub
Markdown documentation
☆251Updated this week
ThoughtWorksInc / Constructor.scala
View on GitHub
Mixin classes and traits dynamically
☆10Sep 4, 2017Updated 8 years ago
filodb / FiloDB
View on GitHub
Distributed Prometheus time series database
☆1,468Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
spotify / hornet
View on GitHub
☆10Nov 15, 2016Updated 9 years ago
lightbend / kafka-streams-query
View on GitHub
Library offering http based query on top of Kafka Streams Interactive Queries
☆70Mar 24, 2023Updated 3 years ago
tpolecat / atto
View on GitHub
friendly little parsers
☆356Aug 19, 2024Updated last year
indix / schemer
View on GitHub
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
☆116Mar 5, 2020Updated 6 years ago
alexarchambault / scalacheck-shapeless
View on GitHub
Generation of arbitrary case classes / ADTs instances with scalacheck and shapeless
☆237Aug 12, 2024Updated last year
alexarchambault / case-app
View on GitHub
Type-level & seamless command-line argument parsing for Scala
☆311Jun 22, 2026Updated 3 weeks ago
47degrees / case-classy
View on GitHub
configuration with less hassle
☆69May 7, 2019Updated 7 years ago