fujimotos / mblevenLinks
An efficient algorithm for k-bounded (Damerau-)Levenshtein distance
☆16Updated 6 years ago
Alternatives and similar repositories for mbleven
Users that are interested in mbleven are comparing it to the libraries listed below
Sorting:
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated 2 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 8 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Anytime Ranking for Impact-Ordered Indexes☆13Updated 8 years ago
- Easy language identification of 380 languages☆17Updated 5 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Updated 5 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆26Updated 6 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆19Updated 8 years ago
- Query Segmentation for search☆20Updated 5 years ago
- Playing with arithmetic coding and RNNs☆22Updated 8 years ago
- Successor to Annoy https://github.com/spotify/annoy☆13Updated 9 years ago
- Supporting example for "A Rust SentencePiece implementation"☆18Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Updated 5 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆33Updated last month
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆16Updated 6 years ago
- ☆28Updated 5 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 5 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Source code for my paper "Matrix Differential Calculus with Tensors (for Machine Learning)"☆12Updated 8 years ago
- A python library to generate highly realistic typos (fuzz-testing)☆11Updated 2 months ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 3 years ago
- Yet another regression toolkit☆12Updated 11 years ago
- Inspired by the neural style algorithm in the computer vision field, we propose a high-level language model with the aim of adapting the …☆19Updated 2 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 3 weeks ago