Python library & examples for Masked Language Model Scoring (ACL 2020)
☆350Dec 20, 2022Updated 3 years ago
Alternatives and similar repositories for mlm-scoring
Users that are interested in mlm-scoring are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jun 10, 2021Updated 5 years ago
- The Benchmark of Linguistic Minimal Pairs☆167Dec 13, 2022Updated 3 years ago
- 📃Language Model based sentences scoring library☆311Updated this week
- BERT score for text generation☆1,901Jul 30, 2024Updated last year
- LAnguage Model Analysis☆1,390Jul 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Mar 20, 2023Updated 3 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆543Nov 15, 2021Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.☆792Aug 4, 2023Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆358Feb 22, 2022Updated 4 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Code for LAMOL: LAnguage MOdeling for Lifelong Language Learning☆95Aug 28, 2020Updated 5 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆332Jan 10, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,153Feb 20, 2024Updated 2 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆467May 28, 2026Updated 2 weeks ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Jul 25, 2023Updated 2 years ago
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆246Sep 17, 2021Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- Easily fine tune GPT-2 to fill in missing text☆202Dec 8, 2022Updated 3 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆397Nov 7, 2023Updated 2 years ago
- ☆49Jun 12, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆97Feb 9, 2023Updated 3 years ago
- Code and Data for ACL 2020 paper "Few-Shot NLG with Pre-Trained Language Model"☆190May 23, 2025Updated last year
- ☆76Mar 18, 2022Updated 4 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,929Feb 14, 2023Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 7 years ago
- Transformer based translation quality estimation☆114Jul 20, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- BERT-related papers☆2,036Aug 12, 2023Updated 2 years ago
- ☆18Feb 1, 2023Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆399May 12, 2022Updated 4 years ago
- jiant is an nlp toolkit☆1,675Jul 6, 2023Updated 2 years ago
- ☆1,294Dec 15, 2022Updated 3 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆232Mar 24, 2023Updated 3 years ago
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆965May 21, 2024Updated 2 years ago