larsmans / lucene-stanford-lemmatizerLinks
A library that adds some NLP capabilities to the Lucene search engine
☆50Updated 11 years ago
Alternatives and similar repositories for lucene-stanford-lemmatizer
Users that are interested in lucene-stanford-lemmatizer are comparing it to the libraries listed below
Sorting:
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- simple simhashing in hadoop with cascading☆33Updated 14 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 5 years ago
- Search a single field with different query time analyzers in Solr☆25Updated 5 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- A RankLib based Solr Learning to Rank Plugin☆29Updated 2 years ago
- A Hadoop toolkit for web-scale information retrieval research☆84Updated 10 years ago
- Implementation of Tyler Neylon's Locality-Specific Hash based on simplex tesselations☆28Updated 13 years ago
- Dice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon H…☆28Updated 6 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 6 years ago
- Movie recommendations and more in MapReduce and Scalding☆117Updated 12 years ago
- Machine learning and natural language processing with Apache Pig☆53Updated 11 years ago
- Large RDF hierarchies as vector spaces☆20Updated 11 years ago
- Dice Solr Plugins from Simon Hughes Dice.com☆87Updated 4 years ago
- The S-Space repsitory, from the AIrhead-Research group☆205Updated 4 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆282Updated 7 years ago
- distributed latent dirichlet allocation☆30Updated 13 years ago
- My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR…☆36Updated 9 years ago
- Stream-based InputFormat for processing the compressed XML dumps of Wikipedia with Hadoop☆85Updated 12 years ago
- A little text processing library for Scala.☆28Updated 9 years ago
- Java text categorization system☆56Updated 8 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- SolrCloud Rebalance API Documentation☆13Updated 8 years ago
- Bulk loading for elastic search☆185Updated last year
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- Examples of Solr configuration entries for Solr plugins and Conceptual Search\Semantic Search from Simon Hughes Dice.com☆26Updated 8 years ago