☆156Aug 24, 2021Updated 4 years ago
Alternatives and similar repositories for hyperformer
Users that are interested in hyperformer are comparing it to the libraries listed below
Sorting:
- ☆130Aug 18, 2022Updated 3 years ago
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 4 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆104Dec 1, 2022Updated 3 years ago
- Learning adapter weights from task descriptions☆19Nov 12, 2023Updated 2 years ago
- ☆54May 8, 2023Updated 2 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- Continual Learning with Hypernetworks. A continual learning approach that has the flexibility to learn a dedicated set of parameters, fin…☆170Apr 11, 2022Updated 3 years ago
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆543Mar 24, 2022Updated 3 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data☆57Aug 5, 2021Updated 4 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆273Apr 15, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- ☆11Nov 11, 2022Updated 3 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆209Dec 18, 2022Updated 3 years ago
- This package implements THOR: Transformer with Stochastic Experts.☆65Oct 7, 2021Updated 4 years ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,801Oct 12, 2025Updated 4 months ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022☆63Mar 23, 2022Updated 3 years ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Feb 13, 2024Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆101May 3, 2024Updated last year
- VisBERT: Demo web app for "How Does BERT Answer Questions?"☆11Jul 22, 2023Updated 2 years ago
- ☆12Dec 6, 2024Updated last year
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"☆242Jan 20, 2023Updated 3 years ago
- ☆27Feb 26, 2023Updated 3 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 2 months ago
- X2Static embeddings☆15Jul 15, 2021Updated 4 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆456Sep 6, 2023Updated 2 years ago
- ☆143Jul 21, 2024Updated last year
- The code for lifelong few-shot language learning☆55Feb 17, 2022Updated 4 years ago
- Building modular LMs with parameter-efficient fine-tuning.☆114Jan 18, 2026Updated last month
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Apr 23, 2022Updated 3 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆70Sep 18, 2022Updated 3 years ago