viig99 / SymSpellCppPy
Fast SymSpell written in c++ and exposes to python via pybind11
☆39Updated last year
Related projects ⓘ
Alternatives and complementary repositories for SymSpellCppPy
- xfspell — the Transformer Spell Checker☆187Updated 4 years ago
- Robust Cross-lingual Embeddings from Parallel Sentences☆20Updated 4 years ago
- ☆134Updated 8 months ago
- Fast and accurate spell correction library☆77Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆71Updated last year
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 3 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆40Updated last year
- C++ wrapper library for the NLP library spaCy☆99Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆32Updated 5 months ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 3 months ago
- Automatic extraction of edited sentences from text edition histories.☆81Updated 2 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- ☆11Updated 3 years ago
- Post-processing OCR errors with seq2seq models☆28Updated 4 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 3 years ago
- Extracts plain text, language identification and more metadata from WARC records☆20Updated 3 months ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆101Updated 2 years ago
- Official source code repository for QueryBlazer: Efficient Query Autocompletion Framework☆19Updated 3 years ago
- OpusFilter - Parallel corpus processing toolkit☆102Updated 3 months ago
- OpenNeuroSpell contains parts of NeuroSpell (http://neurospell.com/en.php) released as open-source. More code will be published as soon a…☆20Updated 3 weeks ago
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 4 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- Tooling to play around with multilingual machine translation for Indian Languages.