Python library & examples for Masked Language Model Scoring (ACL 2020)
☆350Dec 20, 2022Updated 3 years ago
Alternatives and similar repositories for mlm-scoring
Users that are interested in mlm-scoring are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jun 10, 2021Updated 5 years ago
- The Benchmark of Linguistic Minimal Pairs☆168Dec 13, 2022Updated 3 years ago
- 📃Language Model based sentences scoring library☆311Jun 8, 2026Updated 3 weeks ago
- BERT score for text generation☆1,904Jul 30, 2024Updated last year
- LAnguage Model Analysis☆1,390Jul 7, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆543Nov 15, 2021Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆791Aug 4, 2023Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Code for LAMOL: LAnguage MOdeling for Lifelong Language Learning☆95Aug 28, 2020Updated 5 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆332Jan 10, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,154Feb 20, 2024Updated 2 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆466May 28, 2026Updated last month
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Jul 25, 2023Updated 2 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆247Sep 17, 2021Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- Easily fine tune GPT-2 to fill in missing text☆203Dec 8, 2022Updated 3 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆398Nov 7, 2023Updated 2 years ago
- ☆49Jun 12, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆98Feb 9, 2023Updated 3 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆189May 23, 2025Updated last year
- ☆76Mar 18, 2022Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,929Feb 14, 2023Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 7 years ago
- Transformer based translation quality estimation☆114Jul 20, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- BERT-related papers☆2,036Aug 12, 2023Updated 2 years ago
- ☆18Feb 1, 2023Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆400May 12, 2022Updated 4 years ago
- jiant is an nlp toolkit☆1,676Jul 6, 2023Updated 2 years ago
- ☆1,294Dec 15, 2022Updated 3 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆233Mar 24, 2023Updated 3 years ago
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆969May 21, 2024Updated 2 years ago