klawson88 / LevenshteinAutomaton
A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neighbor calculations.
☆41Updated 11 years ago
Related projects: ⓘ
- Various utilities regarding Levenshtein transducers. (Java)☆56Updated 2 years ago
- Java text categorization system☆54Updated 7 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- Java autocomplete library.☆112Updated 4 years ago
- ☆42Updated this week
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆124Updated 6 months ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆70Updated 5 months ago
- NLP framework for JVM languages.☆148Updated 3 years ago
- An efficient and flexible token-based regular expression language and engine.☆74Updated 10 years ago
- A language detection library for the JVM☆35Updated last year
- JSuffixArrays (Suffix Arrays in Java)☆58Updated 7 years ago
- A Java library capable of constructing character-sequence-storing, directed acyclic graphs of minimal size☆43Updated 11 years ago
- Write parsers for arbitrary text inputs, entirely in Java, with no preprocessing phase☆63Updated 8 years ago
- Machine learning components for Apache UIMA☆129Updated last year
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 4 years ago
- SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆18Updated 9 years ago
- WikiXMLJ provides easy access to Wikipedia XML dumps.☆21Updated 7 years ago
- Elasticsearch plugin for b-bit minhash algorism☆62Updated 3 months ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 6 years ago
- Java port of langid.py (language identifier)☆28Updated 11 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆25Updated 2 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 5 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆54Updated 7 years ago
- BK-tree Java library☆28Updated 10 years ago
- ☆28Updated 2 weeks ago
- Facebook's FastText for Java☆78Updated 6 years ago
- Browser-driven explorer for lucene indexes☆72Updated 3 years ago
- BM25F demo with lucene using BlendedTermQuery and a custom similarity☆15Updated 7 years ago
- A Java package for the LDA and DMM topic models☆79Updated 5 years ago