utcompling / OpenNLP-Models
A project for code to create models from existing corpora and distribute models.
☆42Updated 12 years ago
Alternatives and similar repositories for OpenNLP-Models:
Users that are interested in OpenNLP-Models are comparing it to the libraries listed below
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- Website for standardized execution and evaluation of algorithms on datasets.☆36Updated 5 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- distributed latent dirichlet allocation☆30Updated 13 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Document clustering based on Latent Semantic Analysis☆96Updated 14 years ago
- Example code to explore for using DL4J in Scala.☆19Updated 9 years ago
- Entity Linking for the masses☆56Updated 9 years ago
- Mahout vector encoding for pig☆54Updated 2 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Python functions for popular relevance metrics (ndcg, err, etc)☆16Updated last year
- A Hadoop toolkit for web-scale information retrieval research☆82Updated 10 years ago
- Tool for tweaking dbpedia spotlight's models☆16Updated 7 years ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Updated 9 years ago
- xlvector's solution of github contest☆33Updated 15 years ago
- Movie recommendations and more in MapReduce and Scalding☆117Updated 12 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 2 months ago
- CrowdRec reference framework☆32Updated 8 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- (Weighted) Finite State Transducers for Scala NLP☆21Updated 10 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 11 years ago
- Machine Learning for Cascading☆82Updated 9 years ago