MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
☆14Sep 2, 2024Updated last year
Alternatives and similar repositories for MAP
Users that are interested in MAP are comparing it to the libraries listed below
Sorting:
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- [ACL 2024] Knowledge Fusion by Evolving Weights of Language Models☆39Sep 19, 2024Updated last year
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆43Feb 11, 2026Updated last month
- Exploring Model Kinship for Merging Large Language Models☆28Apr 16, 2025Updated 11 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆76Mar 1, 2025Updated last year
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- GPU-accelerated Ant Colony Optimization (ACO)☆17Feb 28, 2025Updated last year
- DSADCSR FOR AIM2019 Extreme Super-Resolution Challenge - Track 1: Fidelity☆13May 27, 2020Updated 5 years ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆208Updated this week
- A collection of papers related to knowledge fusion☆57Oct 11, 2024Updated last year
- Changes to QEMU to accomodate the teensy3.x arm platform (Cortex-m4)☆16Oct 13, 2019Updated 6 years ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 3 years ago
- 浙江大学Beamer模板☆15May 19, 2022Updated 3 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆20Feb 27, 2024Updated 2 years ago
- GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA.☆20Feb 9, 2026Updated last month
- TheDeepChecker: Dynamic Debugger for Neural Networks Training Programs☆10Nov 2, 2022Updated 3 years ago
- [NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging☆48Oct 11, 2024Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆689Updated this week
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆140Mar 17, 2025Updated last year
- ☆17Jan 30, 2023Updated 3 years ago
- [ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects☆15Apr 12, 2020Updated 5 years ago
- Evolutionary Multi-objective Optimization based Neural Architecture Search for Cognitive Diagnosis☆12Sep 5, 2024Updated last year
- Capstone Research Project in NYU Courant☆10Jan 3, 2020Updated 6 years ago
- A benchmark suite for evaluating Spiking Neural Networks (SNNs) on temporal processing tasks, comparing abilities of SNN-related models a…☆21Aug 14, 2025Updated 7 months ago
- UW DigiPsych Prosody Feature Extraction Repository☆13May 16, 2019Updated 6 years ago
- LaTex Poster for S3-NeRF (NeurIPS 2022)☆19Feb 14, 2023Updated 3 years ago
- ☆69Feb 1, 2025Updated last year
- PDA: Privacy-preserving Distributed Algorithms☆15Feb 5, 2026Updated last month
- The Typst template for SUSTech graduated student thesis☆16Jun 18, 2025Updated 9 months ago
- ☆10Jun 28, 2023Updated 2 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- [ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent☆20Dec 4, 2023Updated 2 years ago
- ☆13Apr 17, 2023Updated 2 years ago
- Editing Models with Task Arithmetic☆534Jan 11, 2024Updated 2 years ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 4 years ago
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆21Feb 29, 2024Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- Simple Internet radio built using mpd/mpc and Flask with Buildroot☆16Apr 14, 2024Updated last year