Spark-clustering-notebook / coliseum
Project defining the docker image that will support examples of algorithms created in this organization
☆13Updated 7 years ago
Alternatives and similar repositories for coliseum
Users that are interested in coliseum are comparing it to the libraries listed below
Sorting:
- Some Spark implementations of clustering algorithms.☆19Updated 6 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- Bayesian Networks in Scala☆205Updated 7 years ago
- ☆111Updated 8 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 8 years ago
- ☆21Updated 9 years ago
- Data Science with Apache Spark and Spark Notebook☆30Updated 7 years ago
- Scala bindings for Bokeh plotting library☆136Updated last year
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- Project, source code and data files for 1st edition "Scala for Machine Learning"☆150Updated 9 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- An example of running Apache Spark using Scala in ipython notebook☆140Updated 9 years ago
- Functional, Typesafe, Declarative Data Pipelines☆139Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- Locality Sensitive Hashing for Apache Spark☆195Updated 8 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Spark Modularized View☆42Updated 4 years ago
- Scala: The Unpredicted Lingua Franca for Data Science☆129Updated 6 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Updated 5 years ago
- Topic Modeling with LDA in Scala and Spark☆31Updated 6 years ago
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆134Updated 3 years ago
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- This project provides association rule mining for Apache Spark. The algorithms are based on the work of Philippe Fournier-Viger and comp…☆31Updated 10 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Spark 2.0 Scala Machine Learning examples☆77Updated 5 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Scala wrapper for Annoy☆58Updated 2 years ago
- C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.☆130Updated 4 years ago
- A implementation of the Self-Tuning Spectral Clustering algorithm, and more.☆12Updated 8 years ago