spotify / scio
A Scala API for Apache Beam and Google Cloud Dataflow.
☆2,590Updated last week
Alternatives and similar repositories for scio:
Users that are interested in scio are comparing it to the libraries listed below
- A Scala feature transformation library for data science and machine learning☆467Updated 2 months ago
- Abstract Algebra for Scala☆2,295Updated 8 months ago
- A Scala API for Cascading☆3,515Updated last year
- Base classes to use when writing tests with Spark☆1,528Updated 3 months ago
- Fast, testable, Scala services built on TwitterServer and Finagle☆2,268Updated this week
- Scala combinator library for building Finagle HTTP services☆1,603Updated this week
- Expressive types for Spark.☆884Updated last week
- A tool for data sampling, data generation, and data diffing☆342Updated 3 weeks ago
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆291Updated 2 months ago
- Yet another JSON library for Scala☆2,508Updated this week
- Streaming MapReduce with Scalding and Storm☆2,134Updated 3 years ago
- Essential Spark extensions and helper methods ✨😲☆759Updated 6 months ago
- Protocol buffer compiler for Scala.☆1,313Updated this week
- The missing MatPlotLib for Scala + Spark☆728Updated 3 years ago
- The Internals of Apache Spark☆1,497Updated 7 months ago
- Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.☆1,264Updated last week
- command line options parsing for Scala☆1,436Updated last year
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,406Updated last week
- Pure Scala Artifact Fetching☆2,081Updated this week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆443Updated 2 weeks ago
- Experimental Scala compiler focused on compilation speed☆1,238Updated 3 years ago
- Scala Scripting☆2,620Updated this week
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆267Updated last year
- Deploy über-JARs. Restart processes. (port of codahale/assembly-sbt)☆1,953Updated last month
- Qubole Sparklens tool for performance tuning Apache Spark☆575Updated 10 months ago
- A Scala kernel for Jupyter☆1,615Updated last month
- Generic programming for Scala☆3,399Updated this week
- a command line tool to apply templates defined on GitHub☆1,746Updated last week
- Code formatter for Scala☆1,463Updated this week
- Principled Functional Programming in Scala☆4,667Updated this week