mammothb / editdistpyLinks
Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) distance.
☆25Updated 8 months ago
Alternatives and similar repositories for editdistpy
Users that are interested in editdistpy are comparing it to the libraries listed below
Sorting:
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 3 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated last month
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Rust python bindings for symspell☆21Updated 2 years ago
- Find strings/words in text; convenience and C speed☆126Updated 3 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆44Updated 2 years ago
- ☆30Updated 3 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 4 years ago
- ☆18Updated 2 years ago
- Multi-Langauge Identification☆28Updated last year
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆77Updated 4 months ago
- ☆43Updated 2 years ago
- zero-vocab or low-vocab embeddings☆18Updated 3 years ago
- Prebuilt .whl files for MacOS + Linux of the Facebook FAISS library☆57Updated 3 years ago
- Source code for the Apple reproduction☆33Updated 4 years ago
- xfspell — the Transformer Spell Checker☆189Updated 5 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated 2 weeks ago
- ☆28Updated 2 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆43Updated 3 years ago
- A python package to simulate typographical errors.☆38Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Library for fast text representation and classification.☆31Updated 2 years ago
- Combining encoder-based language models☆11Updated 4 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated last year
- Open source library for few shot NLP☆78Updated 2 years ago
- Experiments with Hugging Face 🔬 🤗☆46Updated last year