Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
☆297Jan 31, 2025Updated last year
Alternatives and similar repositories for big-data-rosetta-code
Users that are interested in big-data-rosetta-code are comparing it to the libraries listed below
Sorting:
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,620Feb 27, 2026Updated 3 weeks ago
- A tool for data sampling, data generation, and data diffing☆346Jan 8, 2026Updated 2 months ago
- A Scala feature transformation library for data science and machine learning☆473Feb 7, 2025Updated last year
- Scio IDEA plugin☆30Oct 2, 2025Updated 5 months ago
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆270Jul 12, 2023Updated 2 years ago
- A Giter8 template for scio☆31Feb 3, 2026Updated last month
- Shapeless utilities for common data types☆67Mar 14, 2026Updated last week
- Building Scio from scratch step by step☆20May 20, 2019Updated 6 years ago
- A collection of Scio exercises inspired by Ruby Koans and many others.☆17Jun 24, 2021Updated 4 years ago
- ☆23Jan 3, 2025Updated last year
- Censorinus is a Scala *StatsD client with multiple personalities.☆21Apr 11, 2020Updated 5 years ago
- GCS support for avro-tools, parquet-tools and protobuf☆79May 5, 2025Updated 10 months ago
- Scala Aggregators used for ML Model metrics monitoring☆91Sep 13, 2023Updated 2 years ago
- Capturing meaningful metrics in your Java application☆67Jul 26, 2024Updated last year
- Abstract Algebra for Scala☆2,301Nov 21, 2025Updated 4 months ago
- A Scala compiler plugin to generate documentation from Scala source files.☆20Oct 18, 2021Updated 4 years ago
- Runs JVM closures in Docker containers on Kubernetes☆36Mar 23, 2018Updated 7 years ago
- A collection of Magnolia add-on modules☆182Feb 12, 2026Updated last month
- Scala extensions for the Kryo serialization library☆619Aug 19, 2024Updated last year
- Expressive types for Spark.☆896Updated this week
- Lightweight real-time big data streaming engine over Akka☆758Mar 1, 2022Updated 4 years ago
- A lightweight workflow definition library☆155Jul 15, 2022Updated 3 years ago
- ☆11Nov 15, 2016Updated 9 years ago
- A collection of Apache Parquet add-on modules☆30Mar 3, 2026Updated 2 weeks ago
- A lightweight reactive RPC-like system built on Akka IO☆45Apr 23, 2015Updated 10 years ago
- Minimal value types for Java☆78Feb 27, 2026Updated 3 weeks ago
- Minimal HTTP cache management library in Scala☆14Updated this week
- Easy, fast, transparent generic derivation of typeclass instances☆794Updated this week
- Code samples for the Lightbend tutorial on writing microservices with Akka Streams, Kafka Streams, and Kafka☆211May 30, 2019Updated 6 years ago
- Scala combinator library for working with binary data☆817Feb 2, 2026Updated last month
- Multi-project build tool, based on sbt.☆84Jan 21, 2023Updated 3 years ago
- A Scala API for Cascading☆3,522May 28, 2023Updated 2 years ago
- Reversible conversions between types☆656Nov 22, 2024Updated last year
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆464Jul 17, 2020Updated 5 years ago
- INACTIVE: A daemon to transfer syslog messages to Apache Kafka.☆24Mar 30, 2017Updated 8 years ago
- Simple & Efficient data access for Scala and Scala.js☆497Jan 23, 2026Updated last month
- Qubole Sparklens tool for performance tuning Apache Spark☆590Jun 26, 2024Updated last year
- DBeam exports SQL tables into Avro files using JDBC and Apache Beam☆194Oct 28, 2025Updated 4 months ago
- Flexible law checking for Scala☆336Mar 2, 2026Updated 2 weeks ago