troshko111 / fast-fuzzy-matchingLinks
BK-tree with Damerau-Levenshtein distance and Trie with Levenshtein distance
☆19Updated 7 years ago
Alternatives and similar repositories for fast-fuzzy-matching
Users that are interested in fast-fuzzy-matching are comparing it to the libraries listed below
Sorting:
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆121Updated 4 years ago
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆34Updated 2 years ago
- Implementation of Aho-Corasick string matching algorithm for .NET☆30Updated 9 years ago
- Fast approximate strings search & spelling correction☆58Updated 3 years ago
- ReactGraph is a library to make change propagation easy in .NET.☆63Updated 10 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 5 years ago
- Fast Word Segmentation with Triangular Matrix☆81Updated 3 years ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Exploration Library in Java☆12Updated 2 years ago
- Txt2Vec is a toolkit to represent text by vector. It's based on Google's word2vec project, but with some new features, such incremental t…☆68Updated 9 years ago
- Advanced Utility Libs☆24Updated 5 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆42Updated 12 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- CRF is a Java implementation of Conditional Random Fields, an algorithm for learning from labeled sequences of examples. It also includes…☆28Updated 10 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 6 years ago
- .NET client for Google Bigtable☆19Updated 9 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- Exploration Library in C#☆16Updated last year
- An utility to randomize and split really huge (100+ GB) text files☆21Updated 8 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 9 years ago
- C# wrapper for the PredictionIO API☆27Updated 9 years ago
- Implementing Like2Vec (Word2Vec for users or items) using TensorFlow☆19Updated 9 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- Java implementation of the TextRank algorithm by Mihalcea, et al. http://lit.csci.unt.edu/index.php/Graph-based_NLP☆29Updated 4 years ago
- ☆32Updated 9 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Search engine library☆30Updated 10 years ago