YahooArchive/samoa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YahooArchive/samoa)

YahooArchive / samoa

SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.

☆427

Alternatives and similar repositories for samoa

Users that are interested in samoa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apache / incubator-samoa
View on GitHub
Mirror of Apache Samoa (Incubating)
☆251May 15, 2026Updated 2 months ago
twitter / summingbird
View on GitHub
Streaming MapReduce with Scalding and Storm
☆2,123Jan 19, 2022Updated 4 years ago
pmerienne / trident-ml
View on GitHub
Trident-ML : A realtime online machine learning library
☆384Dec 16, 2023Updated 2 years ago
yahoo / storm-yarn
View on GitHub
Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.
☆418Jul 21, 2023Updated 3 years ago
stratosphere / stratosphere
View on GitHub
Stratosphere is now Apache Flink.
☆201Dec 16, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
amplab / shark
View on GitHub
Development in Shark has been ended.
☆992Aug 11, 2015Updated 10 years ago
myui / hivemall
View on GitHub
Scalable machine learning library for Apache Hive/Spark/Pig
☆501Dec 2, 2016Updated 9 years ago
walmartlabs / mupd8
View on GitHub
Muppet
☆128May 7, 2021Updated 5 years ago
jubatus / jubatus
View on GitHub
Framework and Library for Distributed Online Machine Learning
☆708May 16, 2019Updated 7 years ago
LinkedInAttic / datafu
View on GitHub
Hadoop library for large-scale data processing, now an Apache Incubator project
☆581Jul 8, 2014Updated 12 years ago
adobe-research / spindle
View on GitHub
Next-generation web analytics processing with Scala, Spark, and Parquet.
☆330Mar 28, 2015Updated 11 years ago
Netflix / suro
View on GitHub
Netflix's distributed Data Pipeline
☆796Apr 10, 2023Updated 3 years ago
addthis / stream-lib
View on GitHub
Stream summarizer and cardinality estimator.
☆2,265Nov 28, 2019Updated 6 years ago
amplab / MLI
View on GitHub
An API for Distributed Machine Learning
☆156Sep 22, 2016Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nathanmarz / storm
View on GitHub
Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more
☆8,771Aug 16, 2017Updated 8 years ago
linkedin / ml-ease
View on GitHub
ADMM based large scale logistic regression
☆340Dec 16, 2023Updated 2 years ago
forcedotcom / phoenix
View on GitHub
☆559Feb 12, 2022Updated 4 years ago
h2oai / h2o-2
View on GitHub
Please visit https://github.com/h2oai/h2o-3 for latest H2O
☆2,254Oct 24, 2024Updated last year
quintona / storm-pattern
View on GitHub
A fork of cascading patterns, but implemented for trident
☆71Dec 16, 2023Updated 2 years ago
twitter-archive / ambrose
View on GitHub
A platform for visualization and real-time monitoring of data workflows
☆1,170Jan 22, 2020Updated 6 years ago
guoding83128 / OpenDL
View on GitHub
The Deep Learning training framework on Spark
☆221May 3, 2025Updated last year
flaxsearch / luwak
View on GitHub
A java library for stored queries
☆381Mar 8, 2023Updated 3 years ago
nathanmarz / elephantdb
View on GitHub
Distributed database specialized in exporting key/value data from Hadoop
☆558Jun 27, 2014Updated 12 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
twitter / scalding
View on GitHub
A Scala API for Cascading
☆3,523May 28, 2023Updated 3 years ago
twitter / algebird
View on GitHub
Abstract Algebra for Scala
☆2,299Nov 21, 2025Updated 8 months ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
madlib / archived_madlib
View on GitHub
MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.
☆508Feb 9, 2018Updated 8 years ago
twitter / tormenta
View on GitHub
Scala extensions for Storm
☆133Jun 7, 2019Updated 7 years ago
LinkedInAttic / camus
View on GitHub
LinkedIn's previous generation Kafka to HDFS pipeline.
☆881Aug 27, 2020Updated 5 years ago
cdapio / tephra
View on GitHub
Apache Tephra: Transactions for HBase.
☆159Sep 13, 2024Updated last year
GraphChi / graphchiDB-scala
View on GitHub
*Experimental* GraphChi-DB graph database with computational capabilities
☆79Oct 7, 2015Updated 10 years ago
superconductor-lang / superconductor
View on GitHub
Big data visualization on the web
☆363Jul 24, 2014Updated 12 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hmsonline / storm-cassandra
View on GitHub
Storm Cassandra Integration
☆183Dec 17, 2023Updated 2 years ago
Cascading / CoPA
View on GitHub
Cascading plus City of Palo Alto open data
☆29Mar 3, 2013Updated 13 years ago
tomdz / storm-esper
View on GitHub
Storm - Esper integration experiment
☆199Feb 11, 2020Updated 6 years ago
pulsarIO / realtime-analytics
View on GitHub
Realtime analytics, this includes the core components of Pulsar pipeline.
☆650Nov 6, 2015Updated 10 years ago
edwardcapriolo / IronCount
View on GitHub
☆34Jan 13, 2019Updated 7 years ago
mesos / spark
View on GitHub
Lightning-fast cluster computing in Java, Scala and Python.
☆1,419Apr 8, 2014Updated 12 years ago
twitter / storehaus
View on GitHub
Storehaus is a library that makes it easy to work with asynchronous key value stores
☆465Jul 17, 2020Updated 6 years ago