mhagiwara / xfspellLinks
xfspell — the Transformer Spell Checker
☆190Updated 4 years ago
Alternatives and similar repositories for xfspell
Users that are interested in xfspell are comparing it to the libraries listed below
Sorting:
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆230Updated 2 years ago
- Segment documents into coherent parts using word embeddings.☆148Updated 3 years ago
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆253Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Build a dialog dataset from online books in many languages☆73Updated 2 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆83Updated 5 years ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated 11 months ago
- Fast and accurate spell correction library☆81Updated 3 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆60Updated 4 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆361Updated last year
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- A sentence segmenter that actually works!☆306Updated 4 years ago
- Misspelling Oblivious Word Embeddings☆201Updated 5 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Stanford's Alexa Prize socialbot☆133Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Preprocessing Library for Natural Language Processing☆163Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 8 months ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 2 years ago
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆123Updated 6 years ago
- ☆72Updated 7 years ago
- Punctuation restoration and spell correction experiments.☆251Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).☆85Updated 2 years ago
- ⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.☆88Updated 2 years ago