timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Updated 3 years ago
Alternatives and similar repositories for dino:
Users that are interested in dino are comparing it to the libraries listed below
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- ☆97Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- ☆75Updated 3 years ago
- A multilingual version of MS MARCO passage ranking dataset☆144Updated last year
- ☆97Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- Language model Prompt And Query Archive☆158Updated 3 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆204Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆104Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆143Updated 2 years ago
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- ☆182Updated last year
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆202Updated 2 years ago
- Hyperparameter Search for AllenNLP☆139Updated last month
- ☆92Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations☆55Updated 2 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆209Updated 2 years ago
- ☆65Updated last year
- State of the art Semantic Sentence Embeddings☆99Updated 2 years ago
- Efficient Attention for Long Sequence Processing☆93Updated last year
- ☆77Updated 11 months ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆102Updated 2 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated last year