fujimotos / mbleven
An efficient algorithm for k-bounded (Damerau-)Levenshtein distance
☆16Updated 6 years ago
Alternatives and similar repositories for mbleven:
Users that are interested in mbleven are comparing it to the libraries listed below
- Code and data from the paper "Email formality in the workplace: A case study on the Enron corpus"☆10Updated 9 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 7 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated last year
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Updated 5 years ago
- brat rapid annotation tool (brat) - for all your textual annotation needs☆10Updated 7 years ago
- Anytime Ranking for Impact-Ordered Indexes☆13Updated 8 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- ☆30Updated 2 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- Official repository of Quickscorer: a fast algorithm to rank documents with additive ensembles of regression trees.☆18Updated 8 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆33Updated last week
- allennlp + streamlit demo☆22Updated 5 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆26Updated 6 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆18Updated 7 years ago
- Yet another regression toolkit☆12Updated 11 years ago
- Query Segmentation for search☆20Updated 4 years ago
- framework for making streamcorpus data☆11Updated 8 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 7 months ago
- A toolkit for generating paraphrase vector representations for words in context☆23Updated 9 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆22Updated 6 years ago
- Playing with arithmetic coding and RNNs☆22Updated 8 years ago
- Python bindings for MetroHash☆19Updated last month
- ☆10Updated 9 years ago
- A dataset of popular pages (taken from <dir.yahoo.com>) with manually marked up semantic blocks.☆15Updated 11 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- Dynamic Entity Summarization (DynES)☆20Updated 5 years ago