fujimotos / mblevenLinks
An efficient algorithm for k-bounded (Damerau-)Levenshtein distance
☆15Updated 6 years ago
Alternatives and similar repositories for mbleven
Users that are interested in mbleven are comparing it to the libraries listed below
Sorting:
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆26Updated 6 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Successor to Annoy https://github.com/spotify/annoy☆13Updated 9 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 8 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated 2 years ago
- Code and data from the paper "Email formality in the workplace: A case study on the Enron corpus"☆10Updated 9 years ago
- Tokenize and clean strings in Python☆12Updated 7 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- GSDMM: Short text clustering (Rust implementation)☆22Updated 2 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- ☆30Updated 3 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆16Updated 6 years ago
- Yet another regression toolkit☆12Updated 11 years ago
- 🔮 spaCy's Machine Learning library for NLP in Python☆8Updated 6 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- LEMON: Explainable Entity Matching☆18Updated 3 years ago
- Easy language identification of 380 languages☆17Updated 5 years ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 8 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆19Updated 8 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- A python library to generate highly realistic typos (fuzz-testing)☆11Updated 3 months ago
- Code and data for "Universal Approximation Functions for Fast Learning to Rank: Replacing Expensive Regression Forests with Simple Feed-F…☆9Updated 6 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Updated 5 years ago
- Anytime Ranking for Impact-Ordered Indexes☆13Updated 8 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Supporting example for "A Rust SentencePiece implementation"☆18Updated 5 years ago