ICLR 2026 "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".
☆50May 31, 2026Updated last week
Alternatives and similar repositories for MLLMerging
Users that are interested in MLLMerging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆50Oct 1, 2025Updated 8 months ago
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers☆34Dec 30, 2024Updated last year
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 7 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated 2 years ago
- Exploring Model Kinship for Merging Large Language Models☆28Apr 16, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code of Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint☆21Oct 23, 2023Updated 2 years ago
- Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)☆10Feb 2, 2024Updated 2 years ago
- 利用大语言模型进行卧底游戏,包括谁是卧底及衍生的发现AI卧底游戏等。☆11Sep 6, 2024Updated last year
- ☆16May 15, 2025Updated last year
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆26Sep 13, 2024Updated last year
- ☆10Jun 28, 2024Updated last year
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆100Apr 1, 2025Updated last year
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 8 months ago
- [NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".☆17Dec 15, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆113Jun 8, 2023Updated 3 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆47Feb 11, 2026Updated 3 months ago
- The code repository for "Multi-layer Rehearsal Feature Augmentation for Class-Incremental Learning" (ICML24)☆12Jun 7, 2024Updated 2 years ago
- 2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集☆11Sep 8, 2021Updated 4 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- ☆17Oct 7, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple Mesh Library written in C☆13Apr 3, 2023Updated 3 years ago
- Code for the paper: Prompts have evil twins (EMNLP 2024)☆24Feb 10, 2025Updated last year
- Codes for Merging Large Language Models☆37Aug 7, 2024Updated last year
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Dec 8, 2022Updated 3 years ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- ☆14May 5, 2022Updated 4 years ago
- The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)☆12Sep 25, 2024Updated last year
- Utilities to parse type information and JSDoc annotations from TypeScript source files, and render Markdown documentation☆12Jun 24, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- ☆16Jul 1, 2024Updated last year
- [ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers☆24Jul 7, 2024Updated last year
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆17Aug 3, 2025Updated 10 months ago
- The collection of papers about Private Evolution☆18Mar 23, 2026Updated 2 months ago
- [ECCV24] The official code repository for paper "Training-Free Model Merging for Multi-target Domain Adaptation".☆18Sep 27, 2024Updated last year