spotify / scio
A Scala API for Apache Beam and Google Cloud Dataflow.
☆2,578Updated this week
Alternatives and similar repositories for scio:
Users that are interested in scio are comparing it to the libraries listed below
- A Scala API for Cascading☆3,513Updated last year
- A Scala feature transformation library for data science and machine learning☆466Updated last week
- Abstract Algebra for Scala☆2,295Updated 6 months ago
- Base classes to use when writing tests with Spark☆1,526Updated last month
- A tool for data sampling, data generation, and data diffing☆341Updated last week
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆291Updated 2 weeks ago
- JSON library☆1,486Updated this week
- Expressive types for Spark.☆883Updated 2 weeks ago
- Streaming MapReduce with Scalding and Storm☆2,136Updated 3 years ago
- Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.☆1,417Updated 2 months ago
- Protocol buffer compiler for Scala.☆1,311Updated this week
- The missing MatPlotLib for Scala + Spark☆729Updated 3 years ago
- Iceberg is a table format for large, slow-moving tabular data☆480Updated last year
- Scala GraphQL implementation☆1,960Updated 3 weeks ago
- KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apach…☆1,183Updated 8 years ago
- 🔍 Elasticsearch Scala Client - Reactive, Non Blocking, Type Safe, HTTP Client☆1,637Updated this week
- Yet another JSON library for Scala☆2,501Updated 3 weeks ago
- Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.☆1,263Updated 3 weeks ago
- Apache Beam is a unified programming model for Batch and Streaming data processing.☆8,003Updated this week
- Generic programming for Scala☆3,398Updated 3 weeks ago
- REST job server for Apache Spark☆2,835Updated last month
- Distributed Prometheus time series database☆1,432Updated this week
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆439Updated 2 months ago
- DataStax Connector for Apache Spark to Apache Cassandra☆1,944Updated last month
- Fast, testable, Scala services built on TwitterServer and Finagle☆2,270Updated 9 months ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆854Updated 4 years ago
- The Internals of Apache Spark☆1,490Updated 5 months ago
- Scala Scripting☆2,611Updated this week
- Pure Scala Artifact Fetching☆2,072Updated this week
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,006Updated 2 years ago