fujimotos / mbleven
An efficient algorithm for k-bounded (Damerau-)Levenshtein distance
☆16Updated 6 years ago
Alternatives and similar repositories for mbleven:
Users that are interested in mbleven are comparing it to the libraries listed below
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- Supporting example for "A Rust SentencePiece implementation"☆18Updated 4 years ago
- Anytime Ranking for Impact-Ordered Indexes☆12Updated 8 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 7 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- Code and data from the paper "Email formality in the workplace: A case study on the Enron corpus"☆10Updated 9 years ago
- Successor to Annoy https://github.com/spotify/annoy☆13Updated 9 years ago
- allennlp + streamlit demo☆22Updated 5 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆18Updated 7 years ago
- A python library to generate highly realistic typos (fuzz-testing)☆11Updated 6 years ago
- Python package to compute metrics on an NLU intent parsing pipeline☆13Updated 4 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated last year
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Playing with arithmetic coding and RNNs☆22Updated 8 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆16Updated 6 years ago
- Ranking Entity Types using the Web of Data☆30Updated 8 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆26Updated 6 years ago
- ☆30Updated 2 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 4 months ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 2 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago
- Text readability metrics in Python.☆11Updated 11 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Updated 4 years ago
- ☆9Updated 8 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago