zju-vipa/training_free_model_merging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zju-vipa/training_free_model_merging)

zju-vipa / training_free_model_merging

This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).

☆34

Alternatives and similar repositories for training_free_model_merging

Users that are interested in training_free_model_merging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AIR-DISCOVER / Model-Merging-MTDA
View on GitHub
[ECCV24] The official code repository for paper "Training-Free Model Merging for Multi-target Domain Adaptation".
☆18Sep 27, 2024Updated last year
prateeky2806 / ties-merging
View on GitHub
☆217Feb 3, 2024Updated 2 years ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year
nverma1 / merging-text-transformers
View on GitHub
Code for "Merging Text Transformers from Different Initializations"
☆20Feb 2, 2025Updated last year
david3684 / AdaRank
View on GitHub
Official codebase for AdaRank: Adaptive Rank Pruning for Enhanced Model Merging (ICLR 2026)
☆20Jan 26, 2026Updated 6 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆870May 5, 2024Updated 2 years ago
mjy1111 / PEAK
View on GitHub
The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models
☆16May 4, 2024Updated 2 years ago
WalkerWorldPeace / DOGE
View on GitHub
Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".
☆23May 23, 2025Updated last year
MYusha / Video-Anomaly-Detection
View on GitHub
classifier two-sample test for video anomaly detections
☆11Jul 3, 2019Updated 7 years ago
nabk89 / NAS-with-Proxy-data
View on GitHub
Official code of "NAS acceleration via proxy data", IJCAI21
☆10May 29, 2022Updated 4 years ago
patrickpynadath1 / candi-diffusion
View on GitHub
CANDI: Continuous and Discrete Diffusion
☆28Oct 27, 2025Updated 9 months ago
kietngt00 / UFC
View on GitHub
[NeurIPS 2025] Universal Few-Shot Spatial Control for Diffusion Models
☆21Sep 18, 2025Updated 10 months ago
peijunallin / alphalora
View on GitHub
☆19Nov 10, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
gstoica27 / ZipIt
View on GitHub
A framework for merging models solving different tasks with different initializations into one multi-task model without any additional tr…
☆316Jan 18, 2024Updated 2 years ago
zheng-ningxin / brp-nas
View on GitHub
☆16Mar 9, 2021Updated 5 years ago
duterscmy / CD-MoE
View on GitHub
Official PyTorch implementation of CD-MOE
☆12Mar 18, 2026Updated 4 months ago
tanganke / peta
View on GitHub
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆26Sep 13, 2024Updated last year
Ernie1 / Pi-NAS
View on GitHub
Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift (ICCV 2021)
☆20Nov 28, 2021Updated 4 years ago
MengLcool / SEGIC
View on GitHub
[ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".
☆27Oct 13, 2024Updated last year
bloomberg / dataless-model-merging
View on GitHub
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
☆92Jul 25, 2023Updated 3 years ago
arcee-ai / PruneMe
View on GitHub
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆267Apr 23, 2024Updated 2 years ago
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
imagination-research / LCSC
View on GitHub
[ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
☆16Feb 15, 2025Updated last year
xieydd / Pytorch-Single-Path-One-Shot-NAS
View on GitHub
Single Path One-Shot NAS MXNet implementation with Supernet training and searching
☆19Dec 23, 2019Updated 6 years ago
csguoh / IntLoRA
View on GitHub
[ICML2025] LoRA fine-tune directly on the INT4 models.
☆41Nov 25, 2024Updated last year
54rt1n / shardmerge
View on GitHub
Using fourier interpolation to merge large language models
☆11Jul 11, 2026Updated 2 weeks ago
lzy7976 / union-set-model-adaptation
View on GitHub
Union-set Multi-source Model Adaptation for Semantic Segmentation
☆12Oct 24, 2022Updated 3 years ago
hahnyuan / ASVD4LLM
View on GitHub
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆92Oct 22, 2024Updated last year
samsja / pydantic_config
View on GitHub
Manage ML configuration with pydantic
☆16Mar 18, 2026Updated 4 months ago
symanto-research / merge-tokenizers
View on GitHub
Package to align tokens from different tokenizations.
☆16Mar 25, 2024Updated 2 years ago
gatech-sysml / CompOFA
View on GitHub
[ICLR 2021] CompOFA: Compound Once-For-All Networks For Faster Multi-Platform Deployment
☆25Jan 5, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Egg-Hu / LoRA-Recycle
View on GitHub
[CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
☆14Jun 20, 2025Updated last year
Sharpiless / Awesome-datafree-KD
View on GitHub
2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集
☆11Sep 8, 2021Updated 4 years ago
yunfanLu / Self-EvRSVFI
View on GitHub
[IEEE TVCG 2025] Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames
☆11Jun 1, 2025Updated last year
KellerJordan / REPAIR
View on GitHub
Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair
☆53Jan 29, 2024Updated 2 years ago
MediaBrain-SJTU / LoRKD
View on GitHub
☆25Nov 8, 2024Updated last year
tihbe / python-ebdataset
View on GitHub
An event based dataset loader under one common python API.
☆10Mar 22, 2022Updated 4 years ago
EnVision-Research / TASC
View on GitHub
☆27Apr 28, 2025Updated last year