ahnobari/ActivationInformedMerging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ahnobari/ActivationInformedMerging)

ahnobari / ActivationInformedMerging

Official repository for Activation-Informed Merging (AIM) of Large Language Models

☆24

Alternatives and similar repositories for ActivationInformedMerging

Users that are interested in ActivationInformedMerging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

duguodong7 / pcb-merging
View on GitHub
[NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging
☆48Oct 11, 2024Updated last year
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
nathanielyvo / WUDI-Merging
View on GitHub
The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆50Oct 1, 2025Updated 9 months ago
apanariello4 / core-space-merging
View on GitHub
Pytorch code for NeurIPS 2025 paper "Accurate and Efficient Low-Rank Model Merging in Core Space"
☆41Feb 2, 2026Updated 5 months ago
uiuctml / MergeBench
View on GitHub
[NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆47Feb 11, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
WalkerWorldPeace / DOGE
View on GitHub
Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".
☆23May 23, 2025Updated last year
danielm1405 / iso-merging
View on GitHub
[ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)
☆46Aug 7, 2025Updated 11 months ago
declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
View on GitHub
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
☆769Jul 17, 2026Updated last week
hahahawu / Long-to-Short-via-Model-Merging
View on GitHub
Model merging is a highly efficient approach for long-to-short reasoning.
☆103Oct 15, 2025Updated 9 months ago
Xu0615 / FinetuneCircuits
View on GitHub
A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.
☆17May 30, 2025Updated last year
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 5 months ago
luli-git / MAP
View on GitHub
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
☆18Sep 2, 2024Updated last year
chili-lab / Spherical-Steering
View on GitHub
[ICML 2026] Spherical Steering: Geometry-Aware Activation Rotation for Language Models
☆17May 19, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
adithya-s-k / MoLE
View on GitHub
Mixture of Lora Experts
☆11Apr 7, 2024Updated 2 years ago
aimagelab / TransFusion
View on GitHub
Official codebase of "Update Your Transformer to the Latest Release: Re-Basin of Task Vectors" - ICML 2025
☆23Jul 30, 2025Updated 11 months ago
xiangchi-yuan / mrl
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
gstoica27 / KnOTS
View on GitHub
Model Merging with SVD to Tie the KnOTS [ICLR 2025]
☆94Apr 3, 2025Updated last year
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
zzp1012 / Cross-Task-Linearity
View on GitHub
[ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"
☆11Feb 20, 2025Updated last year
harveyhuang18 / EMR_Merging
View on GitHub
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆83Mar 1, 2025Updated last year
probabilistic-inference-scaling / probabilistic-inference-scaling
View on GitHub
☆52Mar 17, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AIM-SKKU / RA-Touch
View on GitHub
RA-Touch: Retrieval-Augmented Touch Understanding with Enriched Visual Data (ACM MM '25)
☆15Sep 12, 2025Updated 10 months ago
tootouch / VPT
View on GitHub
Unofficial Visual Prompt Tuning implementation
☆17May 22, 2023Updated 3 years ago
colehawkins / bayesian-tensor-rank-determination
View on GitHub
☆13Dec 17, 2021Updated 4 years ago
FightingFighting / GPS
View on GitHub
This is the repository for paper: Gradient-based Parameter Selection for Efficient Fine-Tuning
☆30Nov 18, 2025Updated 8 months ago
Red-Hat-AI-Innovation-Team / mini_trainer
View on GitHub
fast trainer for educational purposes
☆26Updated this week
duguodong7 / Awesome-Knowledge-Fusion
View on GitHub
A collection of papers related to knowledge fusion
☆58Oct 11, 2024Updated last year
jaeho-lee / oce
View on GitHub
Codes for "Learning bounds for risk-sensitive learning," NeurIPS 2020 (or see arXiv 2006.08138)
☆11Oct 15, 2020Updated 5 years ago
shawnricecake / squant
View on GitHub
[ICCAD 2025] Squant
☆15Jul 3, 2025Updated last year
EMI-Group / tensoraco
View on GitHub
GPU-accelerated Ant Colony Optimization (ACO)
☆18Feb 28, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
StriveZs / MSA-Conv
View on GitHub
TiC: Exploring Vision Transformer in Convolution
☆11Oct 24, 2023Updated 2 years ago
FrankFundel / SGCond
View on GitHub
☆10Jun 28, 2023Updated 3 years ago
jo1jun / Vision_Transformer
View on GitHub
☆18May 16, 2021Updated 5 years ago
AlphaLab-USTC / LRM-plans-CoT
View on GitHub
[NeurIPS 2025] The implementation of paper "On Reasoning Strength Planning in Large Reasoning Models"
☆31Jul 6, 2025Updated last year
shiqichen17 / VLM_Merging
View on GitHub
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
☆89Jun 9, 2026Updated last month
Red-Hat-AI-Innovation-Team / SQuat
View on GitHub
☆22Jun 5, 2025Updated last year
prateeky2806 / ties-merging
View on GitHub
☆216Feb 3, 2024Updated 2 years ago