andrewpalumbo / mahout-samsara-bookLinks

Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.

☆11

Alternatives and similar repositories for mahout-samsara-book

Users that are interested in mahout-samsara-book are comparing it to the libraries listed below

Sorting:

microsoft / mwt-ds-explore-java
Exploration Library in Java
☆12Updated last year
intel-spark / StatisticsOnSpark
Assembly of fundamental statistics implemented based on Apache Spark
☆31Updated 9 years ago
Cascading / pattern
Machine Learning for Cascading
☆82Updated 9 years ago
amplab / ml-matrix
Distributed Matrix Library
☆72Updated 8 years ago
mayconbordin / adpredictor-java
Java implementation of the Microsoft's AdPredictor algorithm
☆17Updated 7 years ago
intel-spark / SparseML
Spark MLlib code optimized to efficiently support sparse data
☆51Updated 8 years ago
jontuk / pychro
Chronicle memory-mapped message journal in Python
☆14Updated 4 years ago
jpatanooga / KnittingBoar
Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework
☆42Updated 12 years ago
lintool / TweetAnalysisWithSpark
Tweet Analysis with Spark
☆15Updated 7 years ago
microsoft / mwt-ds-explore
[Deprecated]: Exploration library
☆16Updated last year
zhangyuc / splash
Splash Project for parallel stochastic learning
☆94Updated 7 years ago
AtlasPilotPuppy / SparkAlgorithms
Additional useful algorithms that can be used with spark.
☆24Updated 10 years ago
adobe-research / spark-gpu
GPU Acceleration for Apache Spark
☆34Updated 9 years ago
Sotera / correlation-approximation
Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets
☆93Updated 9 years ago
ParallelAI / SpyGlass
Cascading and Scalding wrapper for HBase with advanced read features
☆54Updated 5 years ago
collectivemedia / spark-hyperloglog
Interactive Audience Analytics with Spark and HyperLogLog
☆55Updated 9 years ago
wealthfront / thompson-sampling
Java implementation of Thompson sampling to solve the multi-armed bandit problem
☆29Updated last year
eBay / bonsai
open source version of the Bonsai library
☆26Updated 9 years ago
amplab / training
Training materials for Strata, AMP Camp, etc
☆149Updated 9 years ago
grafos-ml / test.fm
Testing framework for Collaborative Filtering
☆38Updated 10 years ago
memsql / streamliner-starter
Starter project for building MemSQL Streamliner Pipelines
☆32Updated 8 years ago
sdhu / elasticsearch-prediction
ElasticSearch Prediction Generator and Plugin
☆22Updated 9 years ago
tdunning / anomaly-detection
A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals
☆102Updated 4 years ago
ofermend / medicare-demo
A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data
☆47Updated 9 years ago
kensuio-oss / NLP-LSTM-Spark
Project for the talk on NLP using LSTM implementation from DL4J on Spark
☆20Updated 9 years ago
mengxr / spark-als
Another, hopefully better, implementation of ALS on Spark
☆14Updated 10 years ago
jpmml / jpmml-hive
PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)
☆13Updated 10 years ago
o19s / search-metrics
Python functions for popular relevance metrics (ndcg, err, etc)
☆16Updated last year
linkedin / ml-ease
ADMM based large scale logistic regression
☆337Updated last year
lucidworks / solr-for-datascience
☆24Updated 9 years ago