ankurdave / kmeans-spark
A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.
☆27Updated 13 years ago
Alternatives and similar repositories for kmeans-spark:
Users that are interested in kmeans-spark are comparing it to the libraries listed below
- Locality Sensitive Hashing for Apache Spark☆195Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- ☆110Updated 7 years ago
- Locality Sensitive Hashing for Apache Spark☆88Updated 2 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- An implementation of DBSCAN runing on top of Apache Spark☆183Updated 7 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 9 years ago
- Visualize streaming machine learning in Spark☆176Updated 7 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)☆94Updated 2 years ago
- An implementation of Markov Clustering algorithm for Spark in Scala☆34Updated 7 years ago
- HDP Data Science/Machine Learning demo☆37Updated 9 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago
- Approximate Nearest Neighbors in Spark☆174Updated 3 years ago
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆106Updated 8 years ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆152Updated 4 years ago
- Large-scale ML & graph analytics on Giraph☆79Updated 9 years ago
- Implementation of the Apriori algorithm using Spark.☆38Updated 10 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Updated 6 years ago
- A Distributed Matrix Operations Library Built on Top of Spark☆106Updated 8 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- An efficient updatable key-value store for Apache Spark☆251Updated 7 years ago
- Pig on Apache Spark☆83Updated 9 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 9 years ago
- Glint: High performance scala parameter server☆168Updated 6 years ago