WalkerWorldPeace/MLLMerging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WalkerWorldPeace/MLLMerging)

WalkerWorldPeace / MLLMerging

ICLR 2026 "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".

☆57

Alternatives and similar repositories for MLLMerging

Users that are interested in MLLMerging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WalkerWorldPeace / DOGE
View on GitHub
Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".
☆23May 23, 2025Updated last year
nathanielyvo / WUDI-Merging
View on GitHub
The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆50Oct 1, 2025Updated 9 months ago
uiuctml / MergeBench
View on GitHub
[NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆47Feb 11, 2026Updated 5 months ago
apanariello4 / core-space-merging
View on GitHub
Pytorch code for NeurIPS 2025 paper "Accurate and Efficient Low-Rank Model Merging in Core Space"
☆41Feb 2, 2026Updated 5 months ago
AuroraZengfh / RobustMerge
View on GitHub
[NeurIPS'25 Spotlight🔥] Official Implementation of RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness
☆67Jun 24, 2026Updated 3 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆235Jun 23, 2026Updated 3 weeks ago
nonwhy / PURE
View on GitHub
[ICCV2025] PyTorch implementation of "Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal …
☆125Jan 24, 2026Updated 5 months ago
shiqichen17 / VLM_Merging
View on GitHub
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
☆89Jun 9, 2026Updated last month
sdc17 / CrossGET
View on GitHub
[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
☆34Dec 30, 2024Updated last year
tanganke / opcm
View on GitHub
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆25Oct 11, 2025Updated 9 months ago
Egg-Hu / Awesome-Synthetic-Data-Generation
View on GitHub
☆19Jan 7, 2026Updated 6 months ago
hahahawu / Long-to-Short-via-Model-Merging
View on GitHub
Model merging is a highly efficient approach for long-to-short reasoning.
☆103Oct 15, 2025Updated 9 months ago
Zhengsh123 / FREE-Merging
View on GitHub
The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)
☆16Jun 26, 2025Updated last year
gstoica27 / KnOTS
View on GitHub
Model Merging with SVD to Tie the KnOTS [ICLR 2025]
☆94Apr 3, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Egg-Hu / LoRA-Recycle
View on GitHub
[CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
☆14Jun 20, 2025Updated last year
THUNLP-MT / ModelCompose
View on GitHub
Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)
☆31Jan 8, 2025Updated last year
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
tianyu139 / tangent-model-composition
View on GitHub
Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…
☆14May 14, 2024Updated 2 years ago
allenai / DrawEduMath
View on GitHub
Can VLMs understand students' hand-drawn math work?
☆19Jan 20, 2026Updated 6 months ago
JinXins / MergeMix
View on GitHub
[ICLR 2026] MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding
☆21Feb 27, 2026Updated 4 months ago
ExplainableML / fomo_in_flux
View on GitHub
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆62Dec 10, 2024Updated last year
zjunlp / ModelKinship
View on GitHub
Exploring Model Kinship for Merging Large Language Models
☆28Apr 16, 2025Updated last year
kyrie-23 / linear_task_arithmetic
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
MergeVLA / MergeVLA
View on GitHub
[CVPR 2026] MergeVLA: Cross-Skill Model Merging Toward a Generalist Vision-Language-Action Agent
☆36Apr 30, 2026Updated 2 months ago
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated 11 months ago
skgyu / SpaceshipNet
View on GitHub
Code of Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint
☆21Oct 23, 2023Updated 2 years ago
prateeky2806 / ties-merging
View on GitHub
☆215Feb 3, 2024Updated 2 years ago
Birch-san / imagebind-guided-diffusion
View on GitHub
Guide diffusion on ImageBind embedding similarity
☆29May 27, 2023Updated 3 years ago
zjucsq / PLA
View on GitHub
[ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision
☆12Sep 17, 2023Updated 2 years ago
nik-dim / tall_masks
View on GitHub
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆53Dec 22, 2025Updated 6 months ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year
danielm1405 / magmax
View on GitHub
[ECCV 2024] MagMax: Leveraging Model Merging for Seamless Continual Learning (official repository)
☆32Jul 29, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Kurt232 / RLKV
View on GitHub
☆35Jun 8, 2026Updated last month
mlfoundations / task_vectors
View on GitHub
Editing Models with Task Arithmetic
☆548Jan 11, 2024Updated 2 years ago
Z1zs / MMUnlearner
View on GitHub
Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…
☆26Jun 17, 2025Updated last year
TimeMarker-LLM / UniComp
View on GitHub
[CVPR 2026] Official repository for "UniComp: Rethinking Video Compression Through Informational Uniqueness"
☆27Feb 22, 2026Updated 4 months ago
LAMDA-CL / Prism
View on GitHub
Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction Tuning
☆34Jun 15, 2026Updated last month
DunZhang / Jasper-Token-Compression-Training
View on GitHub
The training codes of Jasper-Token-Compression-600M
☆20Nov 19, 2025Updated 8 months ago
tdemin16 / multi-lane
View on GitHub
Official Implementation of MULTI-LANE (Multi Label class incremental learning via summarising pAtch tokeN Embeddings). Published in 3rd C…
☆15Feb 20, 2025Updated last year