larsmans / lucene-stanford-lemmatizerLinks
A library that adds some NLP capabilities to the Lucene search engine
☆50Updated 11 years ago
Alternatives and similar repositories for lucene-stanford-lemmatizer
Users that are interested in lucene-stanford-lemmatizer are comparing it to the libraries listed below
Sorting:
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- A Hadoop toolkit for web-scale information retrieval research☆83Updated 10 years ago
- Examples of use of pig scripting languages capabilities☆39Updated 8 years ago
- The S-Space repsitory, from the AIrhead-Research group☆205Updated 4 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- simple simhashing in hadoop with cascading☆33Updated 14 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Solr for Astrophysics Data System☆53Updated last month
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- Using deep learning to POS tag sentences using scala + DL4J☆37Updated 10 years ago
- Movie recommendations and more in MapReduce and Scalding☆117Updated 12 years ago
- Bulk loading for elastic search☆184Updated last year
- Mahout vector encoding for pig☆54Updated 2 years ago
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- A project for code to create models from existing corpora and distribute models.☆42Updated 13 years ago
- NLP Utilities in Java☆43Updated 2 years ago
- Python wrapper for the Vowpal Wabbit machine learning library.☆53Updated 11 years ago
- Search a single field with different query time analyzers in Solr☆25Updated 5 years ago
- iSAX Indexing persisted in HBase☆39Updated 13 years ago
- Latent Dirichlet Allocation for topic modeling of streamed data sources☆100Updated 10 years ago
- A very memory-efficient trie (radix tree) implementation☆47Updated 12 years ago
- distributed latent dirichlet allocation☆30Updated 13 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- (Weighted) Finite State Transducers for Scala NLP☆21Updated 10 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 11 years ago
- Large RDF hierarchies as vector spaces☆20Updated 10 years ago
- (deprecated) Please use new nlp4l instead.☆66Updated 8 years ago
- Elasticsearch Index Termlist☆117Updated 6 years ago
- Course repository for Applied Natural Language Processing☆126Updated 12 years ago