zoho-labs / symspell
Rust python bindings for symspell
☆19Updated last year
Alternatives and similar repositories for symspell:
Users that are interested in symspell are comparing it to the libraries listed below
- ☆30Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- ☆17Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 10 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Multi-Langauge Identification☆29Updated 8 months ago
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆42Updated 4 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated 11 months ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- A Python library for creating adversarial splits☆13Updated 2 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 3 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Library for fast text representation and classification.☆28Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated last week
- Topic Inference with Zeroshot models☆61Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- ☆22Updated 2 years ago
- Language model powered proof reader for correcting contextual errors in natural language.☆24Updated last year
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- ☆43Updated last year