timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Updated 3 years ago
Alternatives and similar repositories for dino:
Users that are interested in dino are comparing it to the libraries listed below
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆153Updated last year
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆141Updated 2 years ago
- ☆75Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆135Updated last year
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆292Updated last year
- ☆97Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year
- ☆188Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- Search Engines with Autoregressive Language models☆283Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated 2 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆172Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆205Updated last year
- Efficient Attention for Long Sequence Processing☆92Updated last year
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- ☆182Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆103Updated last year
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆312Updated last year
- ☆97Updated 2 years ago
- Language model Prompt And Query Archive☆158Updated 3 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆118Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆101Updated 4 years ago
- FactSumm: Factual Consistency Scorer for Abstractive Summarization☆110Updated last year