google-deepmind / pg19
☆225Updated 4 years ago
Related projects: ⓘ
- ☆311Updated 3 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆309Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆200Updated 3 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆456Updated last year
- A library for finding knowledge neurons in pretrained transformer models.☆145Updated 2 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆249Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated last year
- Scalable training for dense retrieval models.☆268Updated last year
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆156Updated last year
- Neural Text Generation with Unlikelihood Training☆311Updated 3 years ago
- Understanding the Difficulty of Training Transformers☆325Updated 2 years ago
- Adversarial Natural Language Inference Benchmark☆388Updated 2 years ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆168Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- ☆114Updated 2 weeks ago
- This repository contains the FewGLUE dataset for few-shot natural language understanding.☆160Updated 4 years ago
- A framework for few-shot evaluation of autoregressive language models.☆98Updated last year
- ☆322Updated 5 months ago
- ☆158Updated last year
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆111Updated last year
- Efficient, check-pointed data loading for deep learning with massive data sets.☆203Updated last year
- GeDi: Generative Discriminator Guided Sequence Generation☆208Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆552Updated this week
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆269Updated last year
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆324Updated 8 months ago
- Scripts and links to recreate the ELI5 dataset.☆316Updated 3 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆143Updated 2 months ago
- ☆67Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆131Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆300Updated last year