UKPLab/iclr2024-model-merging

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UKPLab/iclr2024-model-merging)

UKPLab / iclr2024-model-merging

This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.

☆31

Alternatives and similar repositories for iclr2024-model-merging

Users that are interested in iclr2024-model-merging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyrie-23 / linear_task_arithmetic
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
team-approx-bayes / bayesian-sam
View on GitHub
Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.
☆26Jul 21, 2023Updated 3 years ago
MaximeRobeyns / bayesian_lora
View on GitHub
Bayesian Low-Rank Adaptation for Large Language Models
☆41Jun 22, 2024Updated 2 years ago
tanganke / peta
View on GitHub
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆26Sep 13, 2024Updated last year
neale / HyperGAN
View on GitHub
Generative Model for Neural Networks
☆24Jul 2, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
gortizji / tangent_task_arithmetic
View on GitHub
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆113Jun 8, 2023Updated 3 years ago
thutzr / GLIME-General-Stable-and-Local-LIME-Explanation
View on GitHub
GLIME is a post-hoc explanation method which is proved to be much more stable and faithful than LIME.
☆16Oct 15, 2024Updated last year
EnnengYang / AdaMerging
View on GitHub
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆114Oct 28, 2024Updated last year
mlfoundations / task_vectors
View on GitHub
Editing Models with Task Arithmetic
☆548Jan 11, 2024Updated 2 years ago
Thinklab-SJTU / GAMF
View on GitHub
☆19Feb 15, 2023Updated 3 years ago
jinlanfu / Polyglot_Prompt
View on GitHub
Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.
☆18Dec 7, 2022Updated 3 years ago
prateeky2806 / ties-merging
View on GitHub
☆217Feb 3, 2024Updated 2 years ago
fbarez / neuroplasticity
View on GitHub
☆14Mar 31, 2024Updated 2 years ago
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
fjzzq2002 / random_transformers
View on GitHub
Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)
☆15Sep 28, 2024Updated last year
yifan-h / Multilingual_Space
View on GitHub
Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"
☆12Oct 28, 2022Updated 3 years ago
deeplearning-wisc / tsv
View on GitHub
☆30Jul 17, 2025Updated last year
team-approx-bayes / kpriors
View on GitHub
Code for Knowledge-Adaptation Priors based on the NeurIPS 2021 paper by Khan and Swaroop.
☆17Feb 1, 2022Updated 4 years ago
tum-vision / sublabel_relax
View on GitHub
Code for sublabel-accurate multi-labeling papers (published at CVPR '16, ECCV '16)
☆20Oct 11, 2016Updated 9 years ago
ethz-spylab / superhuman-ai-consistency
View on GitHub
☆30Jun 19, 2023Updated 3 years ago
faridlazuarda / cultural-llm-papers
View on GitHub
A curated list of research papers and resources on Cultural LLM.
☆53Sep 26, 2024Updated last year
StanfordASL / RSIRL
View on GitHub
Risk-sensitive Inverse Reinforcement Learning
☆11Sep 11, 2019Updated 6 years ago
remigenet / SigKAN
View on GitHub
SigKAN: Signature-Weighted Kolmogorov-Arnold Networks for Time Series
☆49Nov 24, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
View on GitHub
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
☆769Updated this week
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
starrYYxuan / UniTE
View on GitHub
☆17Nov 20, 2024Updated last year
zjunlp / PitfallsKnowledgeEditing
View on GitHub
[ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models
☆22Jun 13, 2024Updated 2 years ago
JOHNNY-fans / RankNorm
View on GitHub
☆13Feb 21, 2025Updated last year
PlusLabNLP / Active-IT
View on GitHub
Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"
☆26Nov 16, 2023Updated 2 years ago
MaheepChaudhary / SAE-Ravel
View on GitHub
Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…
☆13Jan 26, 2025Updated last year
fredzzhang / atlas
View on GitHub
[NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"
☆28Feb 24, 2025Updated last year
ganler / ResearchReading
View on GitHub
General system research material (not limited to paper) reading notes.
☆22Mar 17, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆870May 5, 2024Updated 2 years ago
TraceElephant / TraceElephant
View on GitHub
Repo of "Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems" (ACL 2026)
☆16Apr 27, 2026Updated 3 months ago
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
zzp1012 / Cross-Task-Linearity
View on GitHub
[ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"
☆11Feb 20, 2025Updated last year
ByteDance-Seed / DATAMASK
View on GitHub
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
☆21Jan 4, 2026Updated 6 months ago
Tikquuss / meta_XLM
View on GitHub
Cross-lingual Language Model (XLM) pretraining and Model-Agnostic Meta-Learning (MAML) for fast adaptation of deep networks
☆20Mar 26, 2021Updated 5 years ago