graldij / transformer-fusionLinks

Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.

☆30

Alternatives and similar repositories for transformer-fusion

Users that are interested in transformer-fusion are comparing it to the libraries listed below

Sorting:

nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆51Updated last year
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆96Updated last year
PKU-ML / non_neg
Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning
☆46Updated last year
arumaekawa / DiLM
Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".
☆24Updated 9 months ago
tanganke / peta
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆23Updated last year
ElvishElvis / LCA-on-the-line
LCA-on-the-line (ICML 2024 Oral)
☆13Updated 9 months ago
Wuyxin / DISC
(ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation
☆42Updated this week
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated last year
IBM / model-reprogramming
Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>
☆64Updated 2 months ago
anniesch / surgical-finetuning
Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023
☆29Updated 2 years ago
harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆72Updated 8 months ago
AntoAndGar / task_singular_vectors
Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
☆36Updated 3 months ago
tanganke / opcm
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆21Updated last month
hqsiswiliam / hira
The repo for HiRA paper
☆33Updated 4 months ago
Jiachen-T-Wang / GREATS
☆18Updated 8 months ago
mpagli / Agree-to-Disagree
Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"
☆36Updated 3 years ago
EnnengYang / RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆46Updated last year
UKPLab / iclr2024-model-merging
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆29Updated last year
uiuctml / Localize-and-Stitch
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆30Updated 2 months ago
pilancilab / Riemannian_Preconditioned_LoRA
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆32Updated last year
mmatena / model_merging
☆79Updated 3 years ago
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆107Updated 2 years ago
Shwai-He / SparseAdapter
Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"
☆19Updated last year
Nebularaid2000 / bottleneck
PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)
☆38Updated last year
TsinghuaC3I / SoRA
[EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models
☆84Updated last year
OPTML-Group / DP4TL
[NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…
☆14Updated 2 years ago
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆62Updated 2 years ago
ruthless-man / Awesome-Learn-from-Model
Awesome Learn From Model Beyond Fine-Tuning: A Survey
☆77Updated 11 months ago
DaShenZi721 / HRA
☆31Updated 5 months ago
MaximeRobeyns / bayesian_lora
Bayesian Low-Rank Adaptation for Large Language Models
☆36Updated last year