lejon / PartiallyCollapsedLDALinks
Implementations of various fast parallelized samplers for LDA, including Partially Collapsed LDA, Light LDA, Partially Collapsed Light LDA and a very efficient Polya-Urn LDA
☆28Updated 2 years ago
Alternatives and similar repositories for PartiallyCollapsedLDA
Users that are interested in PartiallyCollapsedLDA are comparing it to the libraries listed below
Sorting:
- Online Latent Dirichlet Allocation with Infinite Vocabulary using Variational Inference☆74Updated 10 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 8 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- NLP tools developed by Emory University.☆61Updated 9 years ago
- Automatically exported from code.google.com/p/jforests☆67Updated 5 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Interactive machine learning for text analysis☆85Updated 9 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Automatically exported from code.google.com/p/sofia-ml☆61Updated 5 years ago
- The S-Space repsitory, from the AIrhead-Research group☆204Updated 5 years ago
- A Java library for Stochastic Gradient Descent (SGD)☆22Updated 4 years ago
- Global Vectors for Word Representation on spark☆35Updated 11 years ago
- Machine Learning Tool Kit☆138Updated 5 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 6 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 8 years ago
- Fast Word Clustering Software☆79Updated 10 months ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 9 years ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 11 years ago
- Theano implementation of GloVe for graphs☆47Updated 10 years ago
- Scripts and codes for replicating experiments published in Exploring Topic Coherence over many models and many topics☆83Updated 3 years ago
- Generalized Language Modeling toolkit☆51Updated 3 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- Python port of Mikolov's word2phrase.c from the word2vec toolkit☆111Updated 5 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 10 years ago
- Semantic embeddings of entities☆66Updated 9 years ago
- Tool for tweaking dbpedia spotlight's models☆16Updated 8 years ago
- Splash Project for parallel stochastic learning☆93Updated 8 years ago
- Implicit relation extractor using a natural language model.☆24Updated 7 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆98Updated 10 years ago