aimagelab / TransFusionLinks
Official codebase of "Update Your Transformer to the Latest Release: Re-Basin of Task Vectors" - ICML 2025
☆21Updated 6 months ago
Alternatives and similar repositories for TransFusion
Users that are interested in TransFusion are comparing it to the libraries listed below
Sorting:
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆110Updated 2 years ago
- ☆80Updated 3 years ago
- Editing Models with Task Arithmetic☆529Updated 2 years ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Updated last month
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Updated last year
- A curated list of Model Merging methods.☆96Updated 2 months ago
- ☆208Updated 2 years ago
- [ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)☆36Updated 6 months ago
- Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch☆78Updated 3 years ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2025.☆659Updated last week
- CKA (Centered Kernel Alignment) implemented in PyTorch☆57Updated last month
- Awesome coreset/core-set/subset/sample selection works.☆182Updated last year
- Code for coreset selection methods☆254Updated 2 years ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆76Updated 11 months ago
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆45Updated 6 months ago
- Fine-tuning Vision Transformers on various classification datasets☆114Updated last year
- ☆241Updated last year
- Code accompanying the paper "Massive Activations in Large Language Models"☆195Updated last year
- Bayesian low-rank adaptation for large language models☆28Updated last year
- A simple PyTorch implementation of influence functions.☆92Updated last year
- Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time☆505Updated last year
- Bayesian Low-Rank Adaptation for Large Language Models☆36Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆229Updated last year
- Implementation of Beyond Neural Scaling beating power laws for deep models and prototype-based models☆34Updated 3 months ago
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]☆219Updated 6 months ago
- Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>☆64Updated 4 months ago
- [ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…☆132Updated last year
- [ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)☆29Updated last year
- Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)☆15Updated 2 weeks ago
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆309Updated 2 years ago