searchhub / preDict
Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts
☆80Updated 6 years ago
Related projects: ⓘ
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆63Updated 3 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆84Updated 3 years ago
- Search relevance evaluation toolkit☆73Updated 2 years ago
- Dice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon H…☆28Updated 5 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 3 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Querqy for Elasticsearch☆45Updated 3 weeks ago
- Fast approximate strings search & spelling correction☆57Updated 2 years ago
- Search a single field with different query time analyzers in Solr☆25Updated 4 years ago
- Search relevance evaluation toolkit☆30Updated last year
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 5 years ago
- NLP framework for JVM languages.☆148Updated 3 years ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Updated 4 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆49Updated 4 years ago
- Solr Query Segmenter for structuring unstructured queries☆21Updated 3 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆14Updated 5 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- A Utility Library for Wikipedia dumps☆33Updated 7 years ago
- Hardened Fork of Ranklib learning to rank library☆43Updated last year
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, …☆23Updated 3 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 6 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆41Updated 11 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆34Updated 3 years ago
- ☆16Updated 3 years ago
- Fast Word Clustering Software☆74Updated last month
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 10 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆41Updated 5 years ago
- An open relation extraction system☆46Updated 2 years ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Updated 8 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆123Updated 10 months ago