mhagiwara / xfspell
xfspell — the Transformer Spell Checker
☆189Updated 4 years ago
Alternatives and similar repositories for xfspell:
Users that are interested in xfspell are comparing it to the libraries listed below
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆160Updated 5 months ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- Automatic extraction of edited sentences from text edition histories.☆82Updated 3 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- 📃Language Model based sentences scoring library☆307Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆135Updated last year
- Fast and accurate spell correction library☆81Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆83Updated 5 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆231Updated last year
- Build a dialog dataset from online books in many languages☆72Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- Punctuation restoration and spell correction experiments.☆251Updated 4 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆52Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆155Updated 9 months ago
- Easily fine tune GPT-2 to fill in missing text☆198Updated 2 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 9 months ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- Implementation of the GBST block from the Charformer paper, in Pytorch☆116Updated 3 years ago
- ☆72Updated 6 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆146Updated 3 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago