twitter / summingbird
Streaming MapReduce with Scalding and Storm
☆2,134Updated 3 years ago
Alternatives and similar repositories for summingbird:
Users that are interested in summingbird are comparing it to the libraries listed below
- A Scala API for Cascading☆3,515Updated last year
- Abstract Algebra for Scala☆2,296Updated 8 months ago
- A Thrift parser/generator☆797Updated last month
- Reversible conversions between types☆657Updated 5 months ago
- Lightweight real-time big data streaming engine over Akka☆762Updated 3 years ago
- I/O and Microservice library for Scala☆1,139Updated 3 years ago
- Distributed Prometheus time series database☆1,440Updated last week
- Simplifying robust end-to-end machine learning on Apache Spark.☆470Updated 8 years ago
- Fast, testable, Scala services built on TwitterServer and Finagle☆2,270Updated 2 weeks ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆876Updated 4 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,137Updated 2 years ago
- Distributed NoSQL Database☆513Updated 6 years ago
- Cassovary is a simple big graph processing library for the JVM☆1,048Updated 3 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Updated 2 years ago
- Wonderful reusable code from Twitter☆2,717Updated last week
- [PROJECT IS NO LONGER MAINTAINED] Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streamin…☆725Updated 3 years ago
- Mirror of Apache Samza☆824Updated this week
- Scala extensions for the Kryo serialization library☆616Updated 8 months ago
- Scala School 2☆343Updated 3 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆516Updated 5 years ago
- Storehaus is a library that makes it easy to work with asynchronous key value stores☆466Updated 4 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 2 years ago
- DEPRECATED. Zeppelin has moved to Apache. Please make pull request there☆410Updated 7 years ago
- Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.☆857Updated 4 years ago
- A suite of scala libraries for building and consuming RESTful web services on top of Akka: lightweight, asynchronous, non-blocking, actor…☆2,501Updated 8 years ago
- Quick up and running using Scala for Apache Kafka☆330Updated 7 years ago
- A repository of information, examples and good practices around the Lambda Architecture☆369Updated 7 years ago
- A scala library for connecting to a redis server, or a cluster of redis nodes using consistent hashing on the client side.☆1,021Updated 2 years ago
- Twitter's Effective Scala Guide☆2,243Updated 2 years ago
- Alpakka Kafka connector - Alpakka is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Akka.☆1,416Updated this week