s-ramaswamy / CME-323-project
Implementation of Minimum Spanning Trees on Apache Spark.
☆10Updated 9 years ago
Alternatives and similar repositories for CME-323-project:
Users that are interested in CME-323-project are comparing it to the libraries listed below
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.☆23Updated 7 years ago
- ☆21Updated 9 years ago
- Tensor-based Spectral LDA on Spark☆18Updated 6 years ago
- Vector-free L-BFGS implementation on Spark☆9Updated 8 years ago
- Gaussian Mixture Model Implementation in Pyspark☆32Updated 10 years ago
- A primal-dual framework for distributed L1-regularized optimization☆35Updated 8 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 7 years ago
- Clustering documents based on LSH☆14Updated 8 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- Design algorithms for cross document coreference resolution☆17Updated 11 years ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Probabilistic Itemset Mining☆19Updated 8 years ago
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Updated 8 years ago
- Matrix factorization using TensorFlow☆63Updated 5 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 8 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 8 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 7 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 9 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- ☆36Updated 11 years ago
- Reimplementation of deepwalk algorithm from https://github.com/phanein/deepwalk☆38Updated 9 years ago