ycjing/Awesome-Model-Merging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ycjing/Awesome-Model-Merging)

ycjing / Awesome-Model-Merging

A curated list of Model Merging methods.

☆95

Alternatives and similar repositories for Awesome-Model-Merging

Users that are interested in Awesome-Model-Merging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jiang-Yidi / TransformerDistillation-SLU
View on GitHub
☆13Nov 25, 2021Updated 4 years ago
Yuanshi9815 / LiteFocus
View on GitHub
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆34Mar 11, 2025Updated last year
JngwenYe / LIRF
View on GitHub
Code for ECCV 2022 paper “Learning with Recoverable Forgetting”
☆21Jul 27, 2022Updated 3 years ago
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
View on GitHub
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
☆769Jul 17, 2026Updated last week
Adamdad / vico
View on GitHub
Vico: Compositional Video Generation as Flow Equalization
☆59Nov 15, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
nverma1 / merging-text-transformers
View on GitHub
Code for "Merging Text Transformers from Different Initializations"
☆20Feb 2, 2025Updated last year
jiahaolu97 / anything-unsegmentable
View on GitHub
(CVPR 2024) "Unsegment Anything by Simulating Deformation"
☆29May 27, 2024Updated 2 years ago
gstoica27 / ZipIt
View on GitHub
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…
☆316Jan 18, 2024Updated 2 years ago
Carol-lyh / GateControl
View on GitHub
☆22Apr 3, 2026Updated 3 months ago
VainF / Reasoning-SFT
View on GitHub
SFT of Reasoning LLMs with Megatron-LM
☆23Jun 19, 2025Updated last year
ycjing / AmalgamateGNN.PyTorch
View on GitHub
PyTorch implementation of AmalgamateGNN (CVPR'21)
☆21Jul 29, 2022Updated 3 years ago
czg1225 / VeriThinker
View on GitHub
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆67Sep 27, 2025Updated 9 months ago
Huage001 / StyDeSty
View on GitHub
PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.
☆16Jun 4, 2024Updated 2 years ago
bloomberg / dataless-model-merging
View on GitHub
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
☆92Jul 25, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YinBo0927 / RePro
View on GitHub
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
☆22Jan 6, 2026Updated 6 months ago
prateeky2806 / ties-merging
View on GitHub
☆216Feb 3, 2024Updated 2 years ago
LiQiiiii / Neural-Ligand
View on GitHub
[ICCV‘25] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"
☆45Oct 23, 2025Updated 9 months ago
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year
florinshen / PlaneDreamer
View on GitHub
DreamGaussian with 2D-GS
☆12Oct 10, 2024Updated last year
EnnengYang / AdaMerging
View on GitHub
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆113Oct 28, 2024Updated last year
zju-vipa / training_free_model_merging
View on GitHub
This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).
☆34Mar 5, 2024Updated 2 years ago
Adamdad / Samesame
View on GitHub
An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…
☆10Dec 18, 2019Updated 6 years ago
OliverRensu / GRAT
View on GitHub
This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…
☆56May 21, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JngwenYe / PNCloning
View on GitHub
an official PyTorch implementation of the paper "Partial Network Cloning", CVPR 2023
☆13Mar 21, 2023Updated 3 years ago
luli-git / MAP
View on GitHub
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
☆18Sep 2, 2024Updated last year
horseee / learning-to-cache
View on GitHub
[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
☆122Jul 15, 2024Updated 2 years ago
rahimentezari / PermutationInvariance
View on GitHub
☆23Nov 1, 2022Updated 3 years ago
fagp / sinkhorn-rebasin
View on GitHub
☆18Nov 8, 2023Updated 2 years ago
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆870May 5, 2024Updated 2 years ago
stanislavfort / dissect-git-re-basin
View on GitHub
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆37Sep 19, 2022Updated 3 years ago
VainF / MaskLLM-4V
View on GitHub
Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers
☆15Feb 7, 2025Updated last year
Adamdad / Repfusion
View on GitHub
☆60Oct 6, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
KellerJordan / REPAIR
View on GitHub
Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair
☆53Jan 29, 2024Updated 2 years ago
Huage001 / URAE
View on GitHub
[ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".
☆118May 3, 2025Updated last year
Huage001 / DatasetFactorization
View on GitHub
PyTorch implementation of paper "Dataset Distillation via Factorization" in NeurIPS 2022.
☆67Nov 28, 2022Updated 3 years ago
aktsonthalia / starlight
View on GitHub
Source code for the paper "Do Deep Neural Network Solutions form a Star Domain?"
☆12May 26, 2024Updated 2 years ago
jiahaolu97 / poison-splat
View on GitHub
(ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"
☆78Feb 13, 2025Updated last year
horseee / dKV-Cache
View on GitHub
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆135May 22, 2025Updated last year
Lexie-YU / ViFeEdit
View on GitHub
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
☆67Mar 31, 2026Updated 3 months ago