☆33Jul 8, 2024Updated last year
Alternatives and similar repositories for mats
Users that are interested in mats are comparing it to the libraries listed below
Sorting:
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆24Sep 13, 2024Updated last year
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- ☆12Jul 30, 2025Updated 7 months ago
- ☆210Feb 3, 2024Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆77Mar 1, 2025Updated last year
- Pytorch optimizers implementing Hilbert Constrained Gradient Descent☆19May 9, 2019Updated 6 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Oct 28, 2024Updated last year
- Supervised Training of Conditional Monge Maps☆19Oct 30, 2023Updated 2 years ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- ☆15Mar 30, 2020Updated 5 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆112Jun 8, 2023Updated 2 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆38Feb 27, 2024Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆18Jun 6, 2024Updated last year
- LoFiT: Localized Fine-tuning on LLM Representations☆44Jan 15, 2025Updated last year
- Proximal Optimal Transport Modeling of Population Dynamics (AISTATS 2022)☆22Jun 19, 2023Updated 2 years ago
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 9 months ago
- Exploring Model Kinship for Merging Large Language Models☆27Apr 16, 2025Updated 10 months ago
- ☆19Jan 3, 2025Updated last year
- This is the code for the paper Embrace the Gap: VAEs perform Independent Mechanism Analysis, showing that optimizing the ELBO is equivale…☆23Apr 22, 2024Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Oct 10, 2024Updated last year
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 2 weeks ago
- A set of tests for evaluating large-scale algorithms for Wasserstein-1 transport computation (NeurIPS'22).☆24Sep 9, 2024Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆25Nov 14, 2023Updated 2 years ago
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Nov 27, 2023Updated 2 years ago
- LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs☆29May 31, 2025Updated 9 months ago
- PyTorch implementation of the ICML 2020 paper "Latent Bernoulli Autoencoder"☆25Apr 8, 2021Updated 4 years ago
- ☆26Nov 23, 2023Updated 2 years ago
- Bayesian low-rank adaptation for large language models☆28May 4, 2024Updated last year
- Implementation of Action Matching for the Schrödinger equation☆25Jun 18, 2023Updated 2 years ago
- [ICML2022] Variational Wasserstein gradient flow☆24Oct 17, 2022Updated 3 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- PyTorch implementation of the paper "Continuous Wasserstein-2 Barycenter Estimation without Minimax Optimization" (ICLR 2021)☆34Jun 17, 2022Updated 3 years ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆205Feb 6, 2026Updated 3 weeks ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆32Jun 20, 2023Updated 2 years ago
- Model Stock: All we need is just a few fine-tuned models☆129Aug 9, 2025Updated 6 months ago