bakwc / JamSpell
Modern spell checking library - accurate, fast, multi-language
☆634Updated 7 months ago
Alternatives and similar repositories for JamSpell:
Users that are interested in JamSpell are comparing it to the libraries listed below
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆824Updated 3 weeks ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆414Updated 2 months ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆738Updated last month
- NeuSpell: A Neural Spelling Correction Toolkit☆691Updated last year
- A sentence segmenter that actually works!☆305Updated 4 years ago
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆505Updated 5 years ago
- xfspell — the Transformer Spell Checker☆190Updated 4 years ago
- Tools for shrinking fastText models (in gensim format)☆178Updated 11 months ago
- Unsupervised text tokenizer focused on computational efficiency☆965Updated last year
- Fast topic modeling platform☆669Updated last year
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆201Updated 3 years ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆302Updated 7 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆244Updated 2 years ago
- A python module for English lemmatization and inflection.☆267Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,211Updated 6 months ago
- Models for automatic abstractive summarization☆171Updated 2 years ago
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- Bitextor generates translation memories from multilingual websites☆292Updated 5 months ago
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆919Updated 10 months ago
- Python port of Moses tokenizer, truecaser and normalizer☆493Updated 10 months ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆443Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆316Updated last month
- ☆36Updated 2 years ago
- A list of pretrained Transformer models for the Russian language.☆174Updated 5 years ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆845Updated 7 months ago
- Morphological analyzer for Russian and English languages based on neural networks and dictionary-lookup systems.☆153Updated 10 months ago
- ☆818Updated last year
- Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing☆559Updated 5 months ago
- Fixes contractions such as `you're` to `you are`☆317Updated 2 years ago