awslabs / mlm-scoring
Python library & examples for Masked Language Model Scoring (ACL 2020)
☆340Updated 2 years ago
Alternatives and similar repositories for mlm-scoring:
Users that are interested in mlm-scoring are comparing it to the libraries listed below
- A neural word aligner based on multilingual BERT☆338Updated 2 years ago
- ☆361Updated 2 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆231Updated last year
- Repository to collect and categorize Grammatical Error Correction papers.☆116Updated 4 months ago
- Fast BPE☆666Updated 8 months ago
- Yet Another Neural Machine Translation Toolkit☆177Updated 8 months ago
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆148Updated 2 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆242Updated 3 years ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆326Updated last year
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆203Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆357Updated last year
- ☆187Updated 3 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆221Updated 2 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆307Updated last year
- Pre-Trained Models for ToD-BERT☆291Updated last year
- Improved Sentence Alignment in Linear Time and Space☆167Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆150Updated last week
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- ☆119Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆154Updated 8 months ago
- The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).☆222Updated last month
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- Utilities for Processing the Switchboard Dialogue Act Corpus☆68Updated 4 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- ☆461Updated 3 years ago
- Code for ACL 2020 paper: "Extractive Summarization as Text Matching"☆520Updated 3 years ago