ankurdave / kmeans-spark
A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.
☆27Updated 13 years ago
Alternatives and similar repositories for kmeans-spark:
Users that are interested in kmeans-spark are comparing it to the libraries listed below
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 7 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- An implementation of Markov Clustering algorithm for Spark in Scala☆34Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Updated 5 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆106Updated 8 years ago
- Factorization Machines on Spark and Glint☆25Updated 8 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆106Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Locality Sensitive Hashing for Apache Spark☆195Updated 8 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- ☆33Updated 9 years ago
- Fast-Data-Processing-with-Spark-2☆22Updated 2 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- Some Spark implementations of clustering algorithms.☆19Updated 6 years ago
- Implementation of the Apriori algorithm using Spark.☆38Updated 10 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Coursera Machine Learning class examples in Spark☆43Updated 11 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- Locality Sensitive Hashing for Apache Spark☆87Updated 3 years ago
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆67Updated 9 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆183Updated 7 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Example of running a Genetic Algorithm (Travelling Salesman) on Apache Spark☆43Updated 8 years ago