utcompling / OpenNLP-Models
A project for code to create models from existing corpora and distribute models.
☆42Updated 13 years ago
Alternatives and similar repositories for OpenNLP-Models:
Users that are interested in OpenNLP-Models are comparing it to the libraries listed below
- NLP Utilities in Java☆43Updated 2 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- Document clustering based on Latent Semantic Analysis☆95Updated 15 years ago
- simple simhashing in hadoop with cascading☆33Updated 14 years ago
- Website for standardized execution and evaluation of algorithms on datasets.☆36Updated 5 years ago
- distributed latent dirichlet allocation☆30Updated 13 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Example code to explore for using DL4J in Scala.☆19Updated 9 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Updated 13 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 11 years ago
- Mahout vector encoding for pig☆54Updated 2 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- (deprecated) Please use new nlp4l instead.☆66Updated 8 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- Machine Learning for Cascading☆81Updated 9 years ago
- Easily identify and label sentence intervals using various taggers.☆16Updated 8 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- (Weighted) Finite State Transducers for Scala NLP☆21Updated 10 years ago
- xlvector's solution of github contest☆33Updated 15 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- Machine Learning Open Source Software☆23Updated 6 years ago
- Movie recommendations and more in MapReduce and Scalding☆118Updated 12 years ago
- NYAN is a news filtering engine written in Python and some Ruby.☆15Updated last year
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago