mammothb / editdistpyLinks
Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) distance.
☆24Updated 4 months ago
Alternatives and similar repositories for editdistpy
Users that are interested in editdistpy are comparing it to the libraries listed below
Sorting:
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Rust python bindings for symspell☆21Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 10 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Find strings/words in text; convenience and C speed☆127Updated 3 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆43Updated 2 years ago
- ☆87Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- Open source library for few shot NLP☆79Updated 2 years ago
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆46Updated 2 years ago
- Multi-Langauge Identification☆28Updated last year
- Library for fast text representation and classification.☆31Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- ☆28Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆73Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45Updated last year
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆61Updated 2 years ago
- xfspell — the Transformer Spell Checker☆189Updated 5 years ago
- ☆43Updated 2 years ago
- ☆30Updated 3 years ago
- ☆17Updated 2 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 4 years ago
- A python package to simulate typographical errors.☆37Updated last year
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆41Updated 2 years ago
- super fast cpp implementation of longest common subsequence/substring☆72Updated last year
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- zero-vocab or low-vocab embeddings☆18Updated 3 years ago