lejon / PartiallyCollapsedLDA
Implementations of various fast parallelized samplers for LDA, including Partially Collapsed LDA, Light LDA, Partially Collapsed Light LDA and a very efficient Polya-Urn LDA
☆26Updated last year
Related projects: ⓘ
- Distributed implementation of Robust PLSA using Spark☆12Updated 3 years ago
- ☆22Updated this week
- A Java library for Stochastic Gradient Descent (SGD)☆19Updated 2 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆70Updated 4 years ago
- Topic Modeling with LDA in Scala and Spark☆31Updated 5 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆54Updated 7 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 6 years ago
- A Utility Library for Wikipedia dumps☆33Updated 7 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 8 years ago
- Global Vectors for Word Representation on spark☆35Updated 9 years ago
- Topic modeling with first-order logic (FOL) domain knowledge☆33Updated 12 years ago
- Example code to explore for using DL4J in Scala.☆19Updated 8 years ago
- NLP Utilities in Java☆43Updated last year
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- Code and data for "Universal Approximation Functions for Fast Learning to Rank: Replacing Expensive Regression Forests with Simple Feed-F…☆9Updated 6 years ago
- An efficient and flexible token-based regular expression language and engine.☆74Updated 10 years ago
- Distributed Matrix Library☆70Updated 7 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- A RankLib based Solr Learning to Rank Plugin☆29Updated 2 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 3 years ago
- MinorThird is a collection of Java classes for storing text, annotating text, and learning to extract entities and categorize text.☆55Updated 6 years ago
- Online Latent Dirichlet Allocation with Infinite Vocabulary using Variational Inference☆74Updated 8 years ago
- Machine Learning Library by Emmanouil Antonios Platanios☆28Updated 7 years ago
- Dynamic Topic Modeling and Topic Chains of Reuters News Articles using SCVB0☆23Updated 7 years ago
- Using Word2Vec on lists and sets☆34Updated 8 years ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Updated 8 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 5 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 2 years ago
- Generalized Language Modeling toolkit☆52Updated 2 years ago