zoho-labs / symspell
Rust python bindings for symspell
☆18Updated last year
Alternatives and similar repositories for symspell:
Users that are interested in symspell are comparing it to the libraries listed below
- ☆30Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆17Updated last year
- Library for fast text representation and classification.☆28Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- spaCy entry points for Curated Transformers☆26Updated 4 months ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 6 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated 9 months ago
- Fast fuzzy text search☆11Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 8 months ago
- My NER Experiments with ModernBERT☆17Updated last month
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- sequence tagging with spaCy and crfsuite☆19Updated last year
- Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see☆44Updated 2 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- ☆42Updated last year
- Generate reports for spaCy models.☆29Updated 2 years ago
- 🌸 Train floret vectors☆18Updated last year
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆23Updated 5 months ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 2 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago