Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
☆296Jan 31, 2025Updated last year
Alternatives and similar repositories for big-data-rosetta-code
Users that are interested in big-data-rosetta-code are comparing it to the libraries listed below
Sorting:
- A tool for data sampling, data generation, and data diffing☆345Jan 8, 2026Updated last month
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,615Feb 12, 2026Updated 2 weeks ago
- A Scala feature transformation library for data science and machine learning☆474Feb 7, 2025Updated last year
- A Giter8 template for scio☆31Feb 3, 2026Updated 3 weeks ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆269Jul 12, 2023Updated 2 years ago
- Scio IDEA plugin☆30Oct 2, 2025Updated 4 months ago
- Building Scio from scratch step by step☆20May 20, 2019Updated 6 years ago
- ☆23Jan 3, 2025Updated last year
- Shapeless utilities for common data types☆67Feb 12, 2026Updated 2 weeks ago
- Scala Aggregators used for ML Model metrics monitoring☆91Sep 13, 2023Updated 2 years ago
- A collection of Magnolia add-on modules☆181Feb 12, 2026Updated 2 weeks ago
- Scala extensions for the Kryo serialization library☆619Aug 19, 2024Updated last year
- ☆11Nov 15, 2016Updated 9 years ago
- Abstract Algebra for Scala☆2,302Nov 21, 2025Updated 3 months ago
- Expressive types for Spark.☆896Feb 22, 2026Updated last week
- Minimal HTTP cache management library in Scala☆14Updated this week
- Lightweight real-time big data streaming engine over Akka☆759Mar 1, 2022Updated 4 years ago
- A collection of Apache Parquet add-on modules☆30Feb 12, 2026Updated 2 weeks ago
- GCS support for avro-tools, parquet-tools and protobuf☆79May 5, 2025Updated 9 months ago
- A lightweight reactive RPC-like system built on Akka IO☆45Apr 23, 2015Updated 10 years ago
- Simple & Efficient data access for Scala and Scala.js☆497Jan 23, 2026Updated last month
- A Java library for managing child processes.☆18Dec 8, 2015Updated 10 years ago
- A lightweight workflow definition library☆155Jul 15, 2022Updated 3 years ago
- A fast, streaming-friendly, type-safe, pure-Scala MessagePack library. Supercharge your microservices today!☆61Jun 6, 2021Updated 4 years ago
- Utilities for Akka cluster in production☆100Oct 31, 2017Updated 8 years ago
- A Scala compiler plugin to generate documentation from Scala source files.☆20Oct 18, 2021Updated 4 years ago
- Easy, fast, transparent generic derivation of typeclass instances☆795Feb 18, 2026Updated last week
- Reversible conversions between types☆657Nov 22, 2024Updated last year
- A helping hand for generating sensible data with ScalaCheck☆119Sep 15, 2025Updated 5 months ago
- type-class based data cleansing library for Apache Spark SQL☆78Jun 23, 2019Updated 6 years ago
- Akka extensions for exploiting locality of clustered actors☆10Nov 14, 2019Updated 6 years ago
- Minimal value types for Java☆77Updated this week
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆464Jul 17, 2020Updated 5 years ago
- Code samples for the Lightbend tutorial on writing microservices with Akka Streams, Kafka Streams, and Kafka☆211May 30, 2019Updated 6 years ago
- Flexible law checking for Scala☆334Updated this week
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- Multi-project build tool, based on sbt.☆84Jan 21, 2023Updated 3 years ago
- Runs JVM closures in Docker containers on Kubernetes☆36Mar 23, 2018Updated 7 years ago
- Scala combinator library for working with binary data☆816Feb 2, 2026Updated 3 weeks ago