A curated list of Model Merging methods.
☆95Dec 3, 2025Updated 3 months ago
Alternatives and similar repositories for Awesome-Model-Merging
Users that are interested in Awesome-Model-Merging are comparing it to the libraries listed below
Sorting:
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Jul 27, 2022Updated 3 years ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆680Updated this week
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated 11 months ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆33Mar 5, 2024Updated 2 years ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- PyTorch implementation of AmalgamateGNN (CVPR'21)☆21Jul 29, 2022Updated 3 years ago
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆92Jul 25, 2023Updated 2 years ago
- A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…☆310Jan 18, 2024Updated 2 years ago
- ☆210Feb 3, 2024Updated 2 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Oct 28, 2024Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆115Jul 9, 2025Updated 7 months ago
- ☆19Feb 15, 2023Updated 3 years ago
- [ECCV2022] Factorizing Knowledge in Neural Networks☆91Sep 12, 2022Updated 3 years ago
- ☆18Nov 8, 2023Updated 2 years ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆52Jan 29, 2024Updated 2 years ago
- [ICCV‘25] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"☆43Oct 23, 2025Updated 4 months ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆205Feb 6, 2026Updated 3 weeks ago
- ☆58Oct 6, 2023Updated 2 years ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆117Jul 15, 2024Updated last year
- [ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)☆38Aug 7, 2025Updated 6 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Nov 25, 2025Updated 3 months ago
- ☆23Nov 1, 2022Updated 3 years ago
- ☆42Sep 5, 2023Updated 2 years ago
- (ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"☆76Feb 13, 2025Updated last year
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆44Nov 8, 2024Updated last year
- An interactive demo based on Segment-Anything for style transfer which enables different content regions apply different styles.☆101Apr 24, 2023Updated 2 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆117May 3, 2025Updated 10 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆81Jul 23, 2024Updated last year
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"☆12May 26, 2024Updated last year
- [ICML 2024] DPZero: Private Fine-Tuning of Language Models without Backpropagation☆16Sep 4, 2024Updated last year
- ☆12Feb 11, 2026Updated 3 weeks ago