mhagiwara / xfspell
xfspell — the Transformer Spell Checker
☆190Updated 4 years ago
Alternatives and similar repositories for xfspell:
Users that are interested in xfspell are comparing it to the libraries listed below
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Stanford's Alexa Prize socialbot☆133Updated last year
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 7 months ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆231Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- Code to reproduce the experiments from the paper.☆102Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- A sentence segmenter that actually works!☆306Updated 4 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- Build a dialog dataset from online books in many languages☆73Updated 2 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated 10 months ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- New dataset☆304Updated 3 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- 📃Language Model based sentences scoring library☆308Updated 3 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆360Updated last year
- A benchmark for code-switched NLP, ACL 2020☆74Updated 11 months ago
- Viewer for the 🤗 datasets library.☆84Updated 3 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆342Updated 2 years ago