zoho-labs / symspell
Rust python bindings for symspell
☆18Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for symspell
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Library for fast text representation and classification.☆28Updated 10 months ago
- An extension package of 🤗 Datasets that provides support for executing arbitrary SQL queries on HF datasets☆31Updated 10 months ago
- Align the token outputs from Spacy and Huggingface to help understand what language structures transformers see☆44Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 6 months ago
- ☆29Updated 2 years ago
- A file utility for accessing both local and remote files through a unified interface.☆36Updated this week
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated 6 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆21Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 6 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆31Updated 6 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- KenLM extension for spaCy 2.0.☆16Updated 6 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆22Updated this week
- an experimental implementation of Burrow's delta in Python 3☆20Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Fast fuzzy text search☆11Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- spaCy entry points for Curated Transformers☆25Updated last month
- Language detection using Spacy and Fasttext☆54Updated 11 months ago
- Multi-Langauge Identification☆28Updated 3 months ago
- ISO 639 language codes☆37Updated last month
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆41Updated 3 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- ☆17Updated last year
- ☆15Updated 3 years ago