mammothb / editdistpyLinks
Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) distance.
☆24Updated 5 months ago
Alternatives and similar repositories for editdistpy
Users that are interested in editdistpy are comparing it to the libraries listed below
Sorting:
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- Multi-Langauge Identification☆28Updated last year
- Find strings/words in text; convenience and C speed☆127Updated 3 years ago
- State-of-the-art NLP through transformer models in a modular design and consistent APIs.☆46Updated 2 years ago
- Rust python bindings for symspell☆21Updated last year
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- ☆30Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆41Updated 2 years ago
- ☆43Updated 2 years ago
- A python package to simulate typographical errors.☆38Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 11 months ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 2 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- Rust-based Python wrapper for duckling library in Haskell☆25Updated 4 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆44Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆76Updated 3 weeks ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated last month
- spaCy match and replace, maintaining conjugation☆36Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 4 years ago