twitter/summingbird

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/twitter/summingbird)

twitter / summingbird

Streaming MapReduce with Scalding and Storm

☆2,123

Alternatives and similar repositories for summingbird

Users that are interested in summingbird are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

twitter / scalding
View on GitHub
A Scala API for Cascading
☆3,522May 28, 2023Updated 3 years ago
twitter / algebird
View on GitHub
Abstract Algebra for Scala
☆2,299Nov 21, 2025Updated 8 months ago
twitter / storehaus
View on GitHub
Storehaus is a library that makes it easy to work with asynchronous key value stores
☆465Jul 17, 2020Updated 6 years ago
twitter / tormenta
View on GitHub
Scala extensions for Storm
☆133Jun 7, 2019Updated 7 years ago
nathanmarz / storm
View on GitHub
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
☆8,772Aug 16, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
twitter / bijection
View on GitHub
Reversible conversions between types
☆656Nov 22, 2024Updated last year
twitter / elephant-bird
View on GitHub
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
☆1,134Apr 10, 2023Updated 3 years ago
twitter-archive / ambrose
View on GitHub
A platform for visualization and real-time monitoring of data workflows
☆1,170Jan 22, 2020Updated 6 years ago
yahoo / storm-yarn
View on GitHub
Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.
☆418Jul 21, 2023Updated 3 years ago
twitter / cassovary
View on GitHub
Cassovary is a simple big graph processing library for the JVM
☆1,053Oct 8, 2021Updated 4 years ago
amplab / shark
View on GitHub
Development in Shark has been ended.
☆992Aug 11, 2015Updated 10 years ago
mesos / spark
View on GitHub
Lightning-fast cluster computing in Java, Scala and Python.
☆1,419Apr 8, 2014Updated 12 years ago
twitter / chill
View on GitHub
Scala extensions for the Kryo serialization library
☆618Aug 19, 2024Updated last year
apache / incubator-heron
View on GitHub
Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter
☆3,629Mar 1, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
apache / predictionio
View on GitHub
PredictionIO, a machine learning server for developers and ML engineers.
☆12,520Jan 9, 2021Updated 5 years ago
adobe-research / spindle
View on GitHub
Next-generation web analytics processing with Scala, Spark, and Parquet.
☆330Mar 28, 2015Updated 11 years ago
LinkedInAttic / camus
View on GitHub
LinkedIn's previous generation Kafka to HDFS pipeline.
☆881Aug 27, 2020Updated 5 years ago
typelevel / spire
View on GitHub
Powerful new number types and numeric abstractions for Scala.
☆1,772Updated this week
spray / spray
View on GitHub
A suite of scala libraries for building and consuming RESTful web services on top of Akka: lightweight, asynchronous, non-blocking, actor…
☆2,493Feb 21, 2017Updated 9 years ago
YahooArchive / samoa
View on GitHub
SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.
☆427Mar 28, 2016Updated 10 years ago
twitter / scrooge
View on GitHub
A Thrift parser/generator
☆794Apr 2, 2025Updated last year
twitter-archive / flockdb
View on GitHub
A distributed, fault-tolerant graph database
☆3,317Mar 16, 2017Updated 9 years ago
mesos / chronos
View on GitHub
Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules
☆4,376Jun 29, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LinkedInAttic / datafu
View on GitHub
Hadoop library for large-scale data processing, now an Apache Incubator project
☆581Jul 8, 2014Updated 12 years ago
thinkaurelius / titan
View on GitHub
Distributed Graph Database
☆5,229Oct 19, 2022Updated 3 years ago
addthis / stream-lib
View on GitHub
Stream summarizer and cardinality estimator.
☆2,265Nov 28, 2019Updated 6 years ago
twitter-archive / kestrel
View on GitHub
simple, distributed message queue system (inactive)
☆2,756Jan 22, 2016Updated 10 years ago
nathanmarz / elephantdb
View on GitHub
Distributed database specialized in exporting key/value data from Hadoop
☆558Jun 27, 2014Updated 12 years ago
twitter-archive / ostrich
View on GitHub
A stats collector & reporter for Scala servers (deprecated)
☆766Jun 6, 2019Updated 7 years ago
NICTA / scoobi
View on GitHub
A Scala productivity framework for Hadoop.
☆479Jul 1, 2022Updated 4 years ago
nathanmarz / cascalog
View on GitHub
Data processing on Hadoop without the hassle.
☆1,373May 18, 2023Updated 3 years ago
twitter / hraven
View on GitHub
hRaven collects run time data and statistics from MapReduce jobs in an easily queryable format
☆129Jan 14, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
eligosource / eventsourced
View on GitHub
A library for building reliable, scalable and distributed event-sourced applications in Scala
☆825May 13, 2014Updated 12 years ago
nathanmarz / storm-starter
View on GitHub
Learn to use Storm!
☆926Mar 9, 2016Updated 10 years ago
velvia / ScalaStorm
View on GitHub
Harness the power and elegance of Scala with nathanmarz's Storm real-time system
☆247Sep 6, 2016Updated 9 years ago
Bridgewater / scala-notebook
View on GitHub
Interactive Scala REPL in a browser
☆739May 18, 2022Updated 4 years ago
sameeragarwal / blinkdb
View on GitHub
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
☆660Feb 6, 2014Updated 12 years ago
scala / pickling
View on GitHub
Fast, customizable, boilerplate-free pickling support for Scala
☆829Jun 6, 2017Updated 9 years ago
h2oai / h2o-2
View on GitHub
Please visit https://github.com/h2oai/h2o-3 for latest H2O
☆2,254Oct 24, 2024Updated last year