spotify / scio
A Scala API for Apache Beam and Google Cloud Dataflow.
☆2,583Updated last week
Alternatives and similar repositories for scio:
Users that are interested in scio are comparing it to the libraries listed below
- Abstract Algebra for Scala☆2,296Updated 7 months ago
- Base classes to use when writing tests with Spark☆1,525Updated 2 months ago
- A Scala feature transformation library for data science and machine learning☆466Updated last month
- A tool for data sampling, data generation, and data diffing☆341Updated last week
- Essential Spark extensions and helper methods ✨😲☆758Updated 4 months ago
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆291Updated last month
- Distributed Prometheus time series database☆1,433Updated this week
- Yet another JSON library for Scala☆2,502Updated last week
- Expressive types for Spark.☆884Updated 2 weeks ago
- A Scala API for Cascading☆3,513Updated last year
- The missing MatPlotLib for Scala + Spark☆728Updated 3 years ago
- Streaming MapReduce with Scalding and Storm☆2,136Updated 3 years ago
- Protocol buffer compiler for Scala.☆1,312Updated this week
- JSON library☆1,484Updated this week
- A Scala kernel for Jupyter☆1,610Updated last week
- Iceberg is a table format for large, slow-moving tabular data☆479Updated last year
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆441Updated this week
- Compile-time Language Integrated Queries for Scala☆2,153Updated this week
- "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.☆267Updated last year
- Qubole Sparklens tool for performance tuning Apache Spark☆572Updated 8 months ago
- Pure Scala Artifact Fetching☆2,076Updated this week
- Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.☆1,263Updated last month
- Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.☆1,416Updated 3 months ago
- DataStax Connector for Apache Spark to Apache Cassandra☆1,944Updated this week
- Scala combinator library for building Finagle HTTP services☆1,603Updated this week
- A collection of Scala best practices☆4,383Updated 2 years ago
- Secor is a service implementing Kafka log persistence☆1,843Updated 3 months ago
- The easy way to learn Scala.☆2,634Updated last year
- Scala Scripting☆2,616Updated last week
- A free tutorial for Apache Spark.☆987Updated 4 years ago