troshko111 / fast-fuzzy-matching
BK-tree with Damerau-Levenshtein distance and Trie with Levenshtein distance
☆19Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for fast-fuzzy-matching
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Txt2Vec is a toolkit to represent text by vector. It's based on Google's word2vec project, but with some new features, such incremental t…☆68Updated 8 years ago
- ReactGraph is a library to make change propagation easy in .NET.☆63Updated 9 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 4 years ago
- My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR…☆36Updated 9 years ago
- A C# code generator using Roslyn, extracted from Wasabi v3.1.0. MIT licensed.☆46Updated 9 years ago
- Fast approximate strings search & spelling correction☆57Updated 3 years ago
- Implementation of Aho-Corasick string matching algorithm for .NET☆29Updated 8 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 5 years ago
- A large-scale statistical machine translation system written in Java.☆208Updated 2 years ago
- ☆32Updated 9 years ago
- Python code implementing the MWUA algorithm and a Linear Program solver☆34Updated last year
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Partial Java port of the C++ OpenFST library☆36Updated 2 years ago
- TREC Real-Time Summarization Tools☆15Updated 7 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 9 years ago
- Java text categorization system☆54Updated 7 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 4 years ago
- Base components for Question Answering pipelines☆28Updated 2 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆41Updated 11 years ago
- Web Data Extraction from Flat and Nested Records☆9Updated 8 years ago
- An efficient and flexible token-based regular expression language and engine.☆75Updated 10 years ago
- ScalableJoins☆16Updated 9 years ago
- NLP toolkit (tokenizer, POS-tagger, parser, etc.)☆42Updated 7 years ago