The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆50Oct 1, 2025Updated 8 months ago
Alternatives and similar repositories for WUDI-Merging
Users that are interested in WUDI-Merging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ICLR 2026 "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".☆52May 31, 2026Updated 2 weeks ago
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆23May 23, 2025Updated last year
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆229Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 8 months ago
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆55Dec 15, 2025Updated 5 months ago
- [ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)☆45Aug 7, 2025Updated 10 months ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 3 months ago
- ☆215Feb 3, 2024Updated 2 years ago
- A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimoda…☆195Jun 5, 2026Updated last week
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆17May 30, 2025Updated last year
- Model merging is a highly efficient approach for long-to-short reasoning.☆102Oct 15, 2025Updated 8 months ago
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)☆31Jan 8, 2025Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Feb 20, 2025Updated last year
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated 2 years ago
- ☆56Nov 6, 2024Updated last year
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆53Dec 22, 2025Updated 5 months ago
- Code and Datasets for "Unifying Multi-associations through Hypergraph for Bundle Recommendation"☆11Oct 1, 2022Updated 3 years ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆31May 15, 2024Updated 2 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆26Sep 13, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Unofficial Visual Prompt Tuning implementation☆17May 22, 2023Updated 3 years ago
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆22Sep 24, 2025Updated 8 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- ☆12Feb 11, 2026Updated 4 months ago
- Official codebase of "Update Your Transformer to the Latest Release: Re-Basin of Task Vectors" - ICML 2025☆23Jul 30, 2025Updated 10 months ago
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 7 months ago
- This project uses yolov3 + crnn algorithm to recognize license plate, pytorch☆18Jun 22, 2022Updated 3 years ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆754Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EMNLP25] Official code for "POSITION BIAS MITIGATES POSITION BIAS: Mitigate Position Bias Through Inter-Position Knowledge Distillation…☆38Nov 11, 2025Updated 7 months ago
- Question-Aware Gaussian Experts for Audio-Visual Question Answering -- Official Pytorch Implementation (CVPR'25, Highlight)☆29Jun 6, 2025Updated last year
- ☆27Oct 12, 2024Updated last year
- 珠算代码大模型(Abacus Code LLM)☆58Sep 26, 2024Updated last year
- Implementation of Adversarial Privacy Graph Embedding in TensorFlow☆21Jun 12, 2020Updated 6 years ago
- experimental implementation of Consistory☆20Jul 15, 2024Updated last year
- Code and dataset for NAACL 2022 paper "CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination" Hyounghun Kim, Abhay Zala, Mohi…☆16Nov 26, 2022Updated 3 years ago