theblackcat102 / unify-learning-paradigmsLinks
data collator for UL2 and U-PaLM
☆29Updated last year
Alternatives and similar repositories for unify-learning-paradigms
Users that are interested in unify-learning-paradigms are comparing it to the libraries listed below
Sorting:
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- ☆22Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago
- ☆96Updated 2 years ago
- Embedding Recycling for Language models☆38Updated last year
- ☆34Updated 2 years ago
- Automatic metrics for GEM tasks☆66Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆104Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆28Updated last year
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆114Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆70Updated last year
- ☆24Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- ☆48Updated last year
- ☆66Updated last year
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Updated last year
- ☆72Updated 2 years ago
- ☆97Updated 2 years ago
- ☆100Updated 2 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆136Updated last year
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆105Updated 3 months ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Updated 4 years ago
- ☆159Updated 2 years ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆10Updated 2 months ago
- Experiments with generating opensource language model assistants☆97Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- ☆68Updated 2 years ago
- ☆19Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago