andrewpalumbo / mahout-samsara-bookLinks
Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.
☆11Updated last year
Alternatives and similar repositories for mahout-samsara-book
Users that are interested in mahout-samsara-book are comparing it to the libraries listed below
Sorting:
- Exploration Library in Java☆12Updated last year
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Machine Learning for Cascading☆82Updated 9 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Updated 7 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Chronicle memory-mapped message journal in Python☆14Updated 4 years ago
- Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework☆42Updated 12 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- [Deprecated]: Exploration library☆16Updated last year
- Splash Project for parallel stochastic learning☆94Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Updated 5 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Java implementation of Thompson sampling to solve the multi-armed bandit problem☆29Updated last year
- open source version of the Bonsai library☆26Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Testing framework for Collaborative Filtering☆38Updated 10 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals☆102Updated 4 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 9 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 9 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 10 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- Python functions for popular relevance metrics (ndcg, err, etc)☆16Updated last year
- ADMM based large scale logistic regression☆337Updated last year
- ☆24Updated 9 years ago