zoho-labs / symspellLinks
Rust python bindings for symspell
☆19Updated last year
Alternatives and similar repositories for symspell
Users that are interested in symspell are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆43Updated 4 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- ☆17Updated 2 years ago
- Library for fast text representation and classification.☆30Updated last year
- A Streamlit component for annotating text by text selecting.☆40Updated last year
- Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see☆44Updated 3 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆87Updated 3 months ago
- Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼☆22Updated 5 months ago
- Finds linguistic patterns effortlessly☆37Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆29Updated 6 months ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 4 years ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 6 years ago
- Generate reports for spaCy models.☆29Updated 3 years ago
- ☆55Updated last year
- Agents that build knowledge graphs and explore textual worlds by asking questions☆79Updated last year
- MinHash implementation in Python☆11Updated 10 months ago
- Multi-Langauge Identification☆28Updated 11 months ago
- ☆69Updated 3 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 7 months ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 3 years ago
- ☆19Updated 3 years ago