lejon / PartiallyCollapsedLDALinks
Implementations of various fast parallelized samplers for LDA, including Partially Collapsed LDA, Light LDA, Partially Collapsed Light LDA and a very efficient Polya-Urn LDA
☆28Updated 2 years ago
Alternatives and similar repositories for PartiallyCollapsedLDA
Users that are interested in PartiallyCollapsedLDA are comparing it to the libraries listed below
Sorting:
- Online Latent Dirichlet Allocation with Infinite Vocabulary using Variational Inference☆74Updated 10 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Global Vectors for Word Representation on spark☆35Updated 11 years ago
- A Java library for Stochastic Gradient Descent (SGD)☆22Updated 4 years ago
- Scripts and codes for replicating experiments published in Exploring Topic Coherence over many models and many topics☆82Updated 3 years ago
- Automatically exported from code.google.com/p/jforests☆67Updated 5 years ago
- A Java package for the LDA and DMM topic models☆83Updated 6 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
- Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.☆69Updated 4 years ago
- Interactive machine learning for text analysis☆85Updated 9 years ago
- The S-Space repsitory, from the AIrhead-Research group☆204Updated 5 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Yara K-Beam Arc-Eager Dependency Parser☆56Updated 9 years ago
- Interactive book on Statistical NLP☆32Updated 8 years ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 11 years ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 9 years ago
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- Topic Modeling on Apache Spark☆94Updated 6 years ago
- Machine Learning Tool Kit☆138Updated 5 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- Wikipedia-based Explicit Semantic Analysis, as described by Gabrilovich and Markovitch☆36Updated 5 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Updated 9 years ago
- Fast Word Clustering Software☆79Updated 9 months ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 3 years ago
- Pacaya - A Library for Hybrid Graphical Models and Neural Networks☆45Updated 8 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 6 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- Splash Project for parallel stochastic learning☆94Updated 8 years ago