nik-dim/tall_masks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nik-dim/tall_masks)

nik-dim / tall_masks

Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]

☆53

Alternatives and similar repositories for tall_masks

Users that are interested in tall_masks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wang-kee / LiNeS
View on GitHub
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆31Nov 4, 2024Updated last year
EnnengYang / RepresentationSurgery
View on GitHub
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆49Oct 10, 2024Updated last year
uiuctml / Localize-and-Stitch
View on GitHub
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆32Feb 18, 2026Updated 5 months ago
duguodong7 / pcb-merging
View on GitHub
[NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging
☆48Oct 11, 2024Updated last year
david3684 / AdaRank
View on GitHub
Official codebase for AdaRank: Adaptive Rank Pruning for Enhanced Model Merging (ICLR 2026)
☆20Jan 26, 2026Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
prateeky2806 / ties-merging
View on GitHub
☆217Feb 3, 2024Updated 2 years ago
AntoAndGar / task_singular_vectors
View on GitHub
Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
☆57Dec 15, 2025Updated 7 months ago
harveyhuang18 / EMR_Merging
View on GitHub
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆82Mar 1, 2025Updated last year
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
epfml / REQ
View on GitHub
☆19Jun 10, 2024Updated 2 years ago
mlfoundations / task_vectors
View on GitHub
Editing Models with Task Arithmetic
☆548Jan 11, 2024Updated 2 years ago
danielm1405 / iso-merging
View on GitHub
[ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)
☆46Aug 7, 2025Updated 11 months ago
uiuctml / GOAT
View on GitHub
[JMLR] Gradual Domain Adaptation: Theory and Algorithms
☆11Jan 14, 2025Updated last year
luli-git / MAP
View on GitHub
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
☆18Sep 2, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
davidstutz / robust-generalization-flatness
View on GitHub
Implementation of average- and worst-case robust flatness measures for adversarial training.
☆15Nov 5, 2021Updated 4 years ago
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆870May 5, 2024Updated 2 years ago
gstoica27 / KnOTS
View on GitHub
Model Merging with SVD to Tie the KnOTS [ICLR 2025]
☆94Apr 3, 2025Updated last year
locuslab / acr-memorization
View on GitHub
☆41Dec 19, 2024Updated last year
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆236Jun 23, 2026Updated last month
erosenfeld / disagree_discrep
View on GitHub
Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.
☆10Feb 27, 2024Updated 2 years ago
epfml / halluhard
View on GitHub
A Hard Multi-Turn Hallucination Benchmark
☆34Jul 7, 2026Updated 3 weeks ago
yeonwoo378 / flowbind
View on GitHub
We propose an efficient flow-based multimodal generation model with bidirectional flows.
☆16Feb 18, 2026Updated 5 months ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zju-vipa / training_free_model_merging
View on GitHub
This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).
☆34Mar 5, 2024Updated 2 years ago
kietngt00 / UFC
View on GitHub
[NeurIPS 2025] Universal Few-Shot Spatial Control for Diffusion Models
☆21Sep 18, 2025Updated 10 months ago
alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆13Feb 11, 2026Updated 5 months ago
uiuctml / MergeBench
View on GitHub
[NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆47Feb 11, 2026Updated 5 months ago
kyrie-23 / linear_task_arithmetic
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
UKPLab / iclr2024-model-merging
View on GitHub
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆31May 15, 2024Updated 2 years ago
tml-epfl / sharpness-vs-generalization
View on GitHub
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆44Sep 11, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
SJTU-DeepVisionLab / FLoRA
View on GitHub
☆45Jul 22, 2024Updated 2 years ago
mmatena / model_merging
View on GitHub
☆81Mar 17, 2022Updated 4 years ago
nathanielyvo / WUDI-Merging
View on GitHub
The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""
☆50Oct 1, 2025Updated 9 months ago
gortizji / tangent_task_arithmetic
View on GitHub
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆113Jun 8, 2023Updated 3 years ago
MadryLab / bias-transfer
View on GitHub
☆15Jul 24, 2022Updated 4 years ago