zoho-labs / symspellLinks
Rust python bindings for symspell
☆21Updated last year
Alternatives and similar repositories for symspell
Users that are interested in symspell are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- Fast fuzzy text search☆11Updated 2 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 4 years ago
- ☆17Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 2 months ago
- Library for fast text representation and classification.☆31Updated last year
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- 🐍 Python bidding for the Hora Approximate Nearest Neighbor Search Algorithm library☆73Updated 4 years ago
- Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see☆44Updated 3 years ago
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆25Updated 6 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- Multi-Langauge Identification☆28Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 11 months ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 7 years ago
- KenLM extension for spaCy 2.0.☆16Updated 8 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 3 years ago
- Sentence transformers models for SpaCy☆109Updated 2 years ago
- A Streamlit component for annotating text by text selecting.☆42Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated last year
- Finds linguistic patterns effortlessly☆39Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆24Updated 3 weeks ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Updated last year
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- A python module for word inflections designed for use with spaCy.☆93Updated 5 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated 2 years ago