aria42 / nlp-utilsLinks
NLP Utilities in Java
☆43Updated 3 years ago
Alternatives and similar repositories for nlp-utils
Users that are interested in nlp-utils are comparing it to the libraries listed below
Sorting:
- Scalable Distributed LDA implementation for Spark & Glint☆29Updated 9 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 10 years ago
- Splash Project for parallel stochastic learning☆93Updated 8 years ago
- Regularized latent variable mixed membership modeling☆13Updated 12 years ago
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆43Updated 8 years ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 11 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 10 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 10 years ago
- This toolkit provides an implementation of Modified Adsorption (MAD), a graph-based semi-supervised learning (SSL) algorithm.☆24Updated 8 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 9 years ago
- Topic Modeling on Apache Spark☆94Updated 6 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 10 years ago
- Quick summary: This code implements a spectral (third order tensor decomposition) learning method for learning LDA topic model on Spark.☆104Updated 7 years ago
- NLP tools developed by Emory University.☆61Updated 9 years ago
- I re-implemented a semi-supervised recursive autoencoder in java. I think it is a pretty nice technique. Check it out! Or fork it☆72Updated 8 years ago
- A CPU and GPU-accelerated matrix library for data mining☆267Updated 4 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 10 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- ☆14Updated 9 years ago
- Cython implementation of DeepWalk☆53Updated 2 years ago
- Puck is a lightning-fast parser for natural languages using GPUs☆249Updated 11 years ago
- Locality Sensitive Hashing for Apache Spark☆87Updated 3 years ago
- cuda implementation of CBOW model (word2vec)☆117Updated 12 years ago
- xlvector's solution of github contest☆33Updated 16 years ago
- TuffyLite is an open-source MLN inference engine that modifies the original Tuffy solver.☆27Updated 9 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆98Updated 14 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 9 years ago