πLanguage Model based sentences scoring library
β309Feb 9, 2022Updated 4 years ago
Alternatives and similar repositories for lm-scorer
Users that are interested in lm-scorer are comparing it to the libraries listed below
Sorting:
- Python library & examples for Masked Language Model Scoring (ACL 2020)β348Dec 20, 2022Updated 3 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions oβ¦β103Dec 5, 2023Updated 2 years ago
- Use BERT to Fill in the Blanksβ84Jan 6, 2022Updated 4 years ago
- BLEURT is a metric for Natural Language Generation based on transfer learning.β786Aug 4, 2023Updated 2 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A Cβ¦β38Dec 12, 2020Updated 5 years ago
- β13Mar 1, 2022Updated 4 years ago
- Uses gpt-2 to find all completions of a sentence over a certain probability threshold.β13Mar 17, 2020Updated 5 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β389Nov 7, 2023Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".β189Aug 17, 2021Updated 4 years ago
- Python port of Moses tokenizer, truecaser and normalizerβ495Feb 6, 2026Updated 3 weeks ago
- π Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networksβ12Feb 21, 2020Updated 6 years ago
- INSET: Sentence Infilling with Inter-sentential Transformerβ30Nov 21, 2020Updated 5 years ago
- Official implementation of the papers "GECToR β Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Taggβ¦β951May 21, 2024Updated last year
- BERT score for text generationβ1,876Jul 30, 2024Updated last year
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answeringβ175Jun 6, 2021Updated 4 years ago
- one script for xls-r/xlsr/whisper fine-tuningβ42Jun 29, 2023Updated 2 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.β458Mar 26, 2024Updated last year
- A tool for holistic analysis of language generations systemsβ471Sep 22, 2025Updated 5 months ago
- β120Sep 9, 2020Updated 5 years ago
- β13Aug 23, 2024Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-trainingβ19Oct 12, 2024Updated last year
- Code for obtaining the Curation Corpus abstractive text summarisation datasetβ128Nov 15, 2020Updated 5 years ago
- Neural Text Generation with Unlikelihood Trainingβ310Aug 31, 2021Updated 4 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddingsβ97Jun 12, 2023Updated 2 years ago
- A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/caβ¦β493Dec 12, 2023Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in cβ¦β359Feb 22, 2022Updated 4 years ago
- AIR retriever for Multi-Hop QA (ACL 2020 paper)β30Jul 18, 2020Updated 5 years ago
- NL-Augmenter π¦ β π A Collaborative Repository of Natural Language Transformationsβ786May 19, 2024Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learningβ48Nov 8, 2023Updated 2 years ago
- Natural Perturbation for Robust Question Answeringβ12Apr 7, 2020Updated 5 years ago
- ASR text preprocessing utilityβ21Aug 5, 2024Updated last year
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"β345Nov 11, 2024Updated last year
- β‘ boost inference speed of T5 models by 5x & reduce the model size by 3x.β589Apr 24, 2023Updated 2 years ago
- TextAttack π is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocsβ¦β3,369Jul 10, 2025Updated 7 months ago
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Reβ¦β71Oct 4, 2022Updated 3 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to oβ¦β378Apr 21, 2023Updated 2 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and β¦β317May 28, 2020Updated 5 years ago
- Prune a model while finetuning or training.β406Jun 21, 2022Updated 3 years ago
- A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Opeβ¦β1,573Feb 15, 2023Updated 3 years ago