microsoft / AdaMixLinks

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.12410).

☆132

Alternatives and similar repositories for AdaMix

Users that are interested in AdaMix are comparing it to the libraries listed below

Sorting:

benzakenelad / BitFit
Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
☆142Updated 2 years ago
john-hewitt / backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
☆69Updated 2 years ago
rabeehk / compacter
☆129Updated 2 years ago
XiangLi1999 / ContrastiveDecoding
contrastive decoding
☆203Updated 2 years ago
huawei-noah / Efficient-NLP
☆95Updated last year
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆162Updated 2 months ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago
eric-mitchell / mend
MEND: Fast Model Editing at Scale
☆249Updated last year
bloomberg / dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
☆89Updated 2 years ago
McGill-NLP / length-generalization
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023
☆136Updated last year
kernelmachine / silo-lm
SILO Language Models code repository
☆81Updated last year
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆46Updated last year
thu-coai / PICL
Code for ACL2023 paper: Pre-Training to Learn in Context
☆107Updated last year
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Updated 10 months ago
facebookresearch / MetaICL
An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi
☆270Updated 2 years ago
HKUNLP / icl-ceil
[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.
☆102Updated 2 years ago
morningmoni / UniPELT
Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022
☆62Updated 3 years ago
QingruZhang / PASTA
PASTA: Post-hoc Attention Steering for LLMs
☆122Updated 8 months ago
yizhongw / Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
☆181Updated 2 years ago
booydar / LM-RMT
Recurrent Memory Transformer
☆150Updated last year
swj0419 / in-context-pretraining
☆53Updated last year
SimiaoZuo / MoEBERT
This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).
☆109Updated 3 years ago
arazd / ProgressivePrompts
Progressive Prompts: Continual Learning for Language Models
☆95Updated 2 years ago
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
facebookresearch / NPM
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
☆157Updated 2 years ago
mkshing / Prompt-Tuning
Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"
☆167Updated 3 years ago
facebookresearch / RLCD
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
☆69Updated last year
joeljang / ELM
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Updated 2 years ago
thunlp / Prompt-Transferability
On Transferability of Prompt Tuning for Natural Language Processing
☆99Updated last year
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year