Spark-clustering-notebook / coliseum
Project defining the docker image that will support examples of algorithms created in this organization
☆13Updated 7 years ago
Alternatives and similar repositories for coliseum:
Users that are interested in coliseum are comparing it to the libraries listed below
- Spark Modularized View☆42Updated 4 years ago
- Some Spark implementations of clustering algorithms.☆19Updated 6 years ago
- ☆21Updated 8 years ago
- Reasonable API for serving TensorFlow models using Scala☆31Updated 7 years ago
- An example of running Apache Spark using Scala in ipython notebook☆140Updated 9 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Data Science with Apache Spark and Spark Notebook☆30Updated 7 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- Spark 2.0 Scala Machine Learning examples☆77Updated 5 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 7 years ago
- ☆111Updated 7 years ago
- Bayesian Networks in Scala☆205Updated 7 years ago
- Scala Library/REPL for Machine Learning Research☆201Updated last year
- Scala: The Unpredicted Lingua Franca for Data Science☆129Updated 6 years ago
- Topic Modeling with LDA in Scala and Spark☆31Updated 6 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- something to help you spark☆65Updated 6 years ago
- Joins for skewed datasets in Spark☆57Updated 7 years ago
- A type class for data of all sizes.☆15Updated 5 years ago
- sbt plugin for spark-submit☆96Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Updated 5 years ago
- Secondary sort and streaming reduce for Apache Spark☆78Updated last year
- Scala bindings for Bokeh plotting library☆136Updated last year
- Project, source code and data files for 1st edition "Scala for Machine Learning"☆150Updated 9 years ago
- Scripts for parsing / making sense of yarn logs☆52Updated 8 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Scalable query engine for web scrapping/data mashup/acceptance QA, powered by Apache Spark☆142Updated last week
- Bucketing and partitioning system for Parquet☆30Updated 6 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago