bakwc / JamSpell
Modern spell checking library - accurate, fast, multi-language
☆605Updated 3 weeks ago
Related projects: ⓘ
- NeuSpell: A Neural Spelling Correction Toolkit☆662Updated last year
- Unsupervised text tokenizer focused on computational efficiency☆953Updated 5 months ago
- ✔️Contextual word checker for better suggestions☆405Updated 6 months ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆696Updated 6 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆791Updated 2 weeks ago
- ☆774Updated last year
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆482Updated 4 years ago
- A sentence segmenter that actually works!☆303Updated 4 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆308Updated this week
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆358Updated last week
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,177Updated 6 months ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆724Updated last month
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆782Updated last month
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆139Updated last month
- Fast Neural Machine Translation in C++☆1,225Updated last year
- Bitextor generates translation memories from multilingual websites☆287Updated 3 months ago
- LexRank algorithm for text summarization☆229Updated 5 months ago
- Simple, fast unsupervised word aligner☆732Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆200Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆268Updated last year
- C++ wrapper library for the NLP library spaCy☆99Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆225Updated last year
- Applying BERT to named entity recognition in English and Russian.☆159Updated last year
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆432Updated 5 months ago
- A python module for English lemmatization and inflection.☆258Updated last year
- Models for automatic abstractive summarization☆170Updated 2 years ago
- Named Entity Recognition☆329Updated last year
- Python port of Moses tokenizer, truecaser and normalizer☆486Updated 3 months ago
- Spelling corrector in python☆449Updated 9 months ago