troshko111 / fast-fuzzy-matching
BK-tree with Damerau-Levenshtein distance and Trie with Levenshtein distance
☆19Updated 7 years ago
Related projects: ⓘ
- ☆20Updated this week
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 7 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆41Updated 11 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 4 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Fast approximate strings search & spelling correction☆57Updated 2 years ago
- A C# code generator using Roslyn, extracted from Wasabi v3.1.0. MIT licensed.☆46Updated 9 years ago
- Exploration Library in C#☆15Updated 7 months ago
- Suite of parallel iterative algorithms built on top of Iterative Reduce☆106Updated 10 years ago
- Context sensitive spell checker for Icelandic based on a recurrent neural network model from karpathy/char-rnn. This repo is no longer in…☆40Updated 8 years ago
- Java implementation of the TextRank algorithm by Mihalcea, et al. http://lit.csci.unt.edu/index.php/Graph-based_NLP☆29Updated 3 years ago
- Fast Word Segmentation with Triangular Matrix☆77Updated 2 years ago
- ☆73Updated this week
- Advanced Utility Libs☆23Updated 4 years ago
- Implementation of Aho-Corasick string matching algorithm for .NET☆29Updated 8 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 5 years ago
- ☆32Updated 8 years ago
- Java text categorization system☆54Updated 7 years ago
- Generalized Language Modeling toolkit☆52Updated 2 years ago
- Exploration Library in Java☆12Updated last year
- Partial Java port of the C++ OpenFST library☆36Updated 2 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 4 years ago
- Web Data Extraction from Flat and Nested Records☆9Updated 8 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 6 years ago
- ScalableJoins☆16Updated 8 years ago
- An efficient and flexible token-based regular expression language and engine.☆74Updated 10 years ago
- Earth Mover's Distance in Java☆46Updated 12 years ago
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago