andrewpalumbo / mahout-samsara-book
Accompanying code examples for Apache Mahout: Beyond MapReduce. Distributed Algorithm Design.
☆10Updated last year
Alternatives and similar repositories for mahout-samsara-book:
Users that are interested in mahout-samsara-book are comparing it to the libraries listed below
- Exploration Library in Java☆12Updated last year
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Seldon Spark Jobs☆26Updated 9 years ago
- Using deep learning to POS tag sentences using scala + DL4J☆37Updated 9 years ago
- Distributed Matrix Library☆70Updated 7 years ago
- Machine Learning for Cascading☆82Updated 9 years ago
- Example of using HDInsight (Storm) to read events from Event Hub, write events to HBase, and visualize events using Socket.IO and D3.js☆15Updated 3 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30Updated 5 years ago
- Spark MLlib code optimized to efficiently support sparse data☆50Updated 8 years ago
- The presentation at Spark Summit 2014 showing how 4Quant does production scale image processing and analysis using Spark☆17Updated 10 years ago
- Distributed lbfgs on Apache Spark☆10Updated 4 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 6 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆62Updated 8 years ago
- open source version of the Bonsai library☆26Updated 8 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Updated 6 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- Sparse feature extraction with Spark☆29Updated 6 years ago