Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
☆49Dec 15, 2025Updated 2 months ago
Alternatives and similar repositories for task_singular_vectors
Users that are interested in task_singular_vectors are comparing it to the libraries listed below
Sorting:
- Official codebase for AdaRank: Adaptive Rank Pruning for Enhanced Model Merging (ICLR 2026)☆16Jan 26, 2026Updated last month
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆21May 23, 2025Updated 9 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Oct 10, 2024Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆49Oct 1, 2025Updated 5 months ago
- [ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)☆38Aug 7, 2025Updated 6 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆140Mar 17, 2025Updated 11 months ago
- Personal implementation of ASIF by Antonio Norelli☆26May 24, 2024Updated last year
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Oct 28, 2024Updated last year
- ☆33Jul 8, 2024Updated last year
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆30Oct 20, 2025Updated 4 months ago
- [ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)☆30Jul 29, 2024Updated last year
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆30Jun 7, 2024Updated last year
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Jan 8, 2025Updated last year
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆98Aug 8, 2025Updated 6 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 10 months ago
- Editing Models with Task Arithmetic☆534Jan 11, 2024Updated 2 years ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆674Feb 23, 2026Updated last week
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).☆12Dec 28, 2024Updated last year
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- [KDD Explore'24]Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities☆17May 7, 2025Updated 9 months ago
- Workshop on Text Classification at 1729 Conference☆13Sep 4, 2022Updated 3 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- ☆10Apr 24, 2024Updated last year
- ☆10Aug 15, 2022Updated 3 years ago
- [AAAI-25 Oral] Adaptive Calibration☆14Jul 6, 2025Updated 7 months ago
- Ideas on how to quickly learn to build command-line tools☆11Feb 26, 2022Updated 4 years ago
- Python 3 runtime libraries for ANTLR 4☆13Jun 7, 2015Updated 10 years ago
- PyTorch implementation for "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).☆13Jul 21, 2024Updated last year
- Official Implementation of Robustifying and Boosting Training-Free Neural Architecture Search☆10Mar 12, 2024Updated last year
- Simple TTF rasterizer☆11Mar 29, 2020Updated 5 years ago
- ☆10Jul 27, 2020Updated 5 years ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- ☆40Jan 16, 2026Updated last month
- Code for ASE'24 paper "B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests"☆11Sep 10, 2024Updated last year
- Visual search interface☆11Nov 30, 2021Updated 4 years ago