zoho-labs / symspellLinks
Rust python bindings for symspell
☆19Updated last year
Alternatives and similar repositories for symspell
Users that are interested in symspell are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- Library for fast text representation and classification.☆30Updated last year
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆16Updated 2 months ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago
- ☆17Updated 2 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 3 years ago
- Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see☆44Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆29Updated 5 months ago
- Fast fuzzy text search☆11Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- Finds linguistic patterns effortlessly☆36Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- MinHash implementation in Python☆11Updated 10 months ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ☆22Updated 3 years ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 4 years ago
- Language model powered proof reader for correcting contextual errors in natural language.☆24Updated last year
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- 🌸 Train floret vectors☆18Updated 2 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆17Updated 5 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- A python library to generate highly realistic typos (fuzz-testing)☆11Updated 3 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- ☆8Updated 11 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Multi-Langauge Identification☆28Updated 11 months ago