WalkerWorldPeace/DOGE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WalkerWorldPeace/DOGE)

WalkerWorldPeace / DOGE

Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".

☆23

Alternatives and similar repositories for DOGE

Users that are interested in DOGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nathanielyvo / WUDI-Merging
View on GitHub
The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆50Oct 1, 2025Updated 9 months ago
WalkerWorldPeace / MLLMerging
View on GitHub
ICLR 2026 "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".
☆57Jun 18, 2026Updated last month
Egg-Hu / Awesome-Synthetic-Data-Generation
View on GitHub
☆19Jan 7, 2026Updated 6 months ago
tanganke / opcm
View on GitHub
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆25Oct 11, 2025Updated 9 months ago
xiangchi-yuan / mrl
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
tanganke / weight-ensembling_MoE
View on GitHub
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆32Jun 7, 2024Updated 2 years ago
Egg-Hu / LoRA-Recycle
View on GitHub
[CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
☆14Jun 20, 2025Updated last year
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
hahahawu / Long-to-Short-via-Model-Merging
View on GitHub
Model merging is a highly efficient approach for long-to-short reasoning.
☆103Oct 15, 2025Updated 9 months ago
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
View on GitHub
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
☆769Updated this week
floatingsun / transformer_layers_as_painters
View on GitHub
transformer layers behavior as painters🧑‍🎨
☆15May 6, 2025Updated last year
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
tianyu139 / tangent-model-composition
View on GitHub
Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…
☆14May 14, 2024Updated 2 years ago
AuroraZengfh / RobustMerge
View on GitHub
[NeurIPS'25 Spotlight🔥] Official Implementation of RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness
☆67Jun 24, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
apanariello4 / core-space-merging
View on GitHub
Pytorch code for NeurIPS 2025 paper "Accurate and Efficient Low-Rank Model Merging in Core Space"
☆41Feb 2, 2026Updated 5 months ago
skgyu / SpaceshipNet
View on GitHub
Code of Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint
☆21Oct 23, 2023Updated 2 years ago
THUNLP-MT / ModelCompose
View on GitHub
Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)
☆31Jan 8, 2025Updated last year
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
zzp1012 / Cross-Task-Linearity
View on GitHub
[ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"
☆11Feb 20, 2025Updated last year
zwhe99 / RaSA
View on GitHub
[ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation
☆10May 19, 2025Updated last year
zju-vipa / training_free_model_merging
View on GitHub
This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).
☆34Mar 5, 2024Updated 2 years ago
Zhengsh123 / FREE-Merging
View on GitHub
The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)
☆16Jun 26, 2025Updated last year
AIM-SKKU / RA-Touch
View on GitHub
RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data (ACM MM '25)
☆15Sep 12, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hyhuang00 / moe_inference
View on GitHub
Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".
☆19Oct 30, 2024Updated last year
LTS5 / ReservoirTTA
View on GitHub
[preprint] ReservoirTTA: Prolonged Test-time Adaptation for Evolving and Recurring Domains
☆15Aug 20, 2025Updated 11 months ago
LAMDA-CL / ICCV2025-TUNA
View on GitHub
Integrating Task-Specific and Universal Adapters for Pre-Trained Model-based Class-Incremental Learning (ICCV 2025)
☆18Sep 23, 2025Updated 9 months ago
iclr2024mcmi / ICLRMCMI
View on GitHub
Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information
☆12Sep 28, 2023Updated 2 years ago
weixuan-wang123 / ReMaKE
View on GitHub
☆14Sep 1, 2025Updated 10 months ago
Xu0615 / FinetuneCircuits
View on GitHub
A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.
☆17May 30, 2025Updated last year
IST-DASLab / HALO
View on GitHub
HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…
☆31Feb 17, 2025Updated last year
LZY-the-boys / Twin-Merging
View on GitHub
[NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
☆144Mar 17, 2025Updated last year
antgroup / importance-aware-sparse-tuning-IST-paper
View on GitHub
☆22Dec 23, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
gortizji / tangent_task_arithmetic
View on GitHub
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆114Jun 8, 2023Updated 3 years ago
GangweiJiang / FvForgetting
View on GitHub
☆15Apr 20, 2025Updated last year
scarlet0703 / LoRA-Sub-DRS
View on GitHub
Official PyTorch implementation of our CVPR 2025 paper, "LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning."
☆18Mar 28, 2025Updated last year
lliai / DisWOT-CVPR2023
View on GitHub
☆28Nov 29, 2023Updated 2 years ago
aromanusc / SoundQ
View on GitHub
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
☆14Mar 21, 2025Updated last year
Robiwan245 / SiamMAE
View on GitHub
☆12Mar 5, 2024Updated 2 years ago
nik-dim / tall_masks
View on GitHub
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆53Dec 22, 2025Updated 6 months ago