The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆49Oct 1, 2025Updated 5 months ago
Alternatives and similar repositories for WUDI-Merging
Users that are interested in WUDI-Merging are comparing it to the libraries listed below
Sorting:
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆21May 23, 2025Updated 9 months ago
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆21Feb 10, 2025Updated last year
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- Official implementation of "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".☆43Oct 30, 2025Updated 4 months ago
- Code for Research Project TLDR☆25Jul 28, 2025Updated 7 months ago
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆49Dec 15, 2025Updated 2 months ago
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆23Oct 11, 2025Updated 4 months ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- [ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)☆38Aug 7, 2025Updated 6 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Oct 10, 2024Updated last year
- ☆210Feb 3, 2024Updated 2 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Sep 13, 2024Updated last year
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆31Jun 7, 2024Updated last year
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆29May 15, 2024Updated last year
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Jan 8, 2025Updated last year
- Model merging is a highly efficient approach for long-to-short reasoning.☆100Oct 15, 2025Updated 4 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- ☆37Jan 25, 2026Updated last month
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆680Updated this week
- 2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集☆11Sep 8, 2021Updated 4 years ago
- Fully open reproduction of DeepSeek-R1☆12Mar 24, 2025Updated 11 months ago
- 研究生课程笔记。包含组合数学、高级算法设计与分析、最优化理论与应用、大数据分析与挖掘。☆15Dec 17, 2023Updated 2 years ago
- [ACL 2023] Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation.☆10Dec 19, 2024Updated last year
- ☆15Jul 22, 2024Updated last year
- ☆18Jun 23, 2025Updated 8 months ago
- code for "Combating Noise: Semi-supervised Learning by Region Uncertainty Quantification"☆10Mar 19, 2022Updated 3 years ago
- JPEG编解码从零开始实现(python JPEG codec)☆10Jul 29, 2022Updated 3 years ago
- ☆13Jul 15, 2025Updated 7 months ago
- ☆13Jul 15, 2024Updated last year
- ☆10Dec 26, 2023Updated 2 years ago
- ☆14Sep 1, 2025Updated 6 months ago
- lol助手秒选亚索☆12Jun 12, 2022Updated 3 years ago
- An event based dataset loader under one common python API.☆10Mar 22, 2022Updated 3 years ago
- This is the open-source code for TokenCarve.☆23Jan 23, 2026Updated last month
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 4 months ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Dec 17, 2023Updated 2 years ago
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 6 months ago