awslabs / mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020)
☆342Updated 2 years ago
Alternatives and similar repositories for mlm-scoring:
Users that are interested in mlm-scoring are comparing it to the libraries listed below
- Easily fine tune GPT-2 to fill in missing text☆199Updated 2 years ago
- A neural word aligner based on multilingual BERT☆345Updated 3 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆231Updated 2 years ago
- A tool for holistic analysis of language generations systems☆468Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Yet Another Neural Machine Translation Toolkit☆178Updated 3 weeks ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆327Updated last year
- Fast BPE☆668Updated 9 months ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆205Updated last year
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆200Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆361Updated last year
- Repository to collect and categorize Grammatical Error Correction papers.☆116Updated 5 months ago
- ☆362Updated 2 years ago
- Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data☆250Updated 4 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆529Updated 3 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆242Updated 3 years ago
- ☆318Updated 3 years ago
- New dataset☆303Updated 3 years ago
- cLang-8 is a dataset for grammatical error correction.☆103Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆642Updated 2 years ago
- ☆119Updated 4 years ago
- ☆190Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆154Updated this week
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆282Updated last year
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated 8 months ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆301Updated 4 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆307Updated 2 years ago