graldij / transformer-fusion
Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.
☆28Updated 10 months ago
Alternatives and similar repositories for transformer-fusion:
Users that are interested in transformer-fusion are comparing it to the libraries listed below
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆20Updated 6 months ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆23Updated 2 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆41Updated 4 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆39Updated 5 months ago
- [EMNLP 2023 Main] Sparse Low-rank Adaptation of Pre-trained Language Models☆72Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆49Updated last week
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆51Updated 3 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆97Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆67Updated 4 months ago
- A curated list of Model Merging methods.☆90Updated 5 months ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆14Updated 5 months ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆60Updated last year
- ☆13Updated 10 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆27Updated 9 months ago
- LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters☆30Updated last week
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆12Updated 2 months ago
- Data distillation benchmark☆57Updated 3 weeks ago
- ☆65Updated 2 years ago
- Repo for the paper: "Agree to Disagree: Diversity through Disagreement for Better Transferability"☆35Updated 2 years ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆17Updated 6 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆34Updated last month
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆14Updated 7 months ago
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆17Updated last month
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)☆41Updated 11 months ago
- Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>☆60Updated 11 months ago
- ☆19Updated 9 months ago
- [NeurIPS 2024] BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models☆24Updated last month