Official implementation of "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".
☆50Oct 30, 2025Updated 5 months ago
Alternatives and similar repositories for MLLMerging
Users that are interested in MLLMerging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆49Oct 1, 2025Updated 6 months ago
- [CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs☆14Jun 20, 2025Updated 10 months ago
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers☆34Dec 30, 2024Updated last year
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 6 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code of Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint☆21Oct 23, 2023Updated 2 years ago
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆33Mar 11, 2025Updated last year
- ☆16May 15, 2025Updated 11 months ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆720Updated this week
- ☆10Jun 28, 2024Updated last year
- Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports☆68Mar 15, 2026Updated last month
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆99Apr 1, 2025Updated last year
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 5 years ago
- [NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".☆16Dec 15, 2025Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆112Jun 8, 2023Updated 2 years ago
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆46Feb 11, 2026Updated 2 months ago
- ☆11Sep 1, 2024Updated last year
- Export Notion page to markdown format file☆10May 10, 2021Updated 4 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 6 months ago
- A simple Mesh Library written in C☆13Apr 3, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for the paper: Prompts have evil twins (EMNLP 2024)☆24Feb 10, 2025Updated last year
- ☆15Jul 1, 2024Updated last year
- Codes for Merging Large Language Models☆35Aug 7, 2024Updated last year
- repo for Tibetan corpora☆23Apr 10, 2023Updated 3 years ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Dec 8, 2022Updated 3 years ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆22Mar 8, 2026Updated last month
- Code for our EMNLP '22 paper "Fixing Model Bugs with Natural Language Patches"☆19Dec 7, 2022Updated 3 years ago
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- An event based dataset loader under one common python API.☆10Mar 22, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NDSS 2025] CENSOR: Defense Against Gradient Inversion via Orthogonal Subspace Bayesian Sampling☆17Jan 18, 2025Updated last year
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆16Sep 2, 2024Updated last year
- [ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers☆23Jul 7, 2024Updated last year
- LocalHost of PIA in Windows☆13Dec 25, 2023Updated 2 years ago
- The collection of papers about Private Evolution☆18Mar 23, 2026Updated last month
- 天池上的一场长期赛(心跳信号分类预测),非常简单朴素的实现,长期赛榜单第8名(258.9817分)☆17Jan 30, 2022Updated 4 years ago
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated 2 years ago