rabeehk / compacterLinks

☆129

Alternatives and similar repositories for compacter

Users that are interested in compacter are comparing it to the libraries listed below

Sorting:

benzakenelad / BitFit
Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models
☆142Updated 2 years ago
rabeehk / hyperformer
☆157Updated 3 years ago
microsoft / AdaMix
This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.1…
☆132Updated last year
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆102Updated 2 years ago
arazd / ProgressivePrompts
Progressive Prompts: Continual Learning for Language Models
☆95Updated 2 years ago
bloomberg / dataless-model-merging
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
☆89Updated 2 years ago
kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆53Updated 3 years ago
john-hewitt / backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
☆69Updated 2 years ago
uds-lsv / MCSE
NAACL 2022: MCSE: Multimodal Contrastive Learning of Sentence Embeddings
☆55Updated last year
eric-mitchell / mend
MEND: Fast Model Editing at Scale
☆249Updated last year
NoviScl / GPT3-Reliability
☆78Updated 2 years ago
McGill-NLP / polytropon
☆54Updated 2 years ago
cambridgeltl / composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆74Updated 11 months ago
BaohaoLiao / mefts
[NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
☆31Updated 2 years ago
qcwthu / Lifelong-Fewshot-Language-Learning
The code for lifelong few-shot language learning
☆55Updated 3 years ago
llyx97 / TAMT
[NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…
☆15Updated 2 years ago
PiotrNawrot / dynamic-pooling
Efficient Transformers with Dynamic Token Pooling
☆62Updated 2 years ago
morningmoni / UniPELT
Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022
☆62Updated 3 years ago
jzbjyb / ReAtt
Retrieval as Attention
☆83Updated 2 years ago
xlang-ai / icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
☆108Updated 2 years ago
HKUNLP / icl-ceil
[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.
☆102Updated 2 years ago
fuzihaofzh / AnalyzeParameterEfficientFinetune
On the Effectiveness of Parameter-Efficient Fine-Tuning
☆38Updated last year
machelreid / diffuser
DiffusER: Discrete Diffusion via Edit-based Reconstruction (Reid, Hellendoorn & Neubig, 2022)
☆54Updated 2 years ago
joeljang / ELM
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Updated 2 years ago
huawei-noah / Efficient-NLP
☆95Updated last year
ychen-stat-ml / kernel-adapters
Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…
☆11Updated 2 years ago
XiangLi1999 / ContrastiveDecoding
contrastive decoding
☆203Updated 2 years ago
aviclu / ffn-values
☆62Updated 2 years ago
princeton-nlp / DinkyTrain
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
☆114Updated 2 years ago
Hunter-DDM / stablemoe
Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"
☆47Updated 3 years ago