A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training
☆313Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for ZipIt
Users that are interested in ZipIt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆216Feb 3, 2024Updated 2 years ago
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- A curated list of Model Merging methods.☆95Dec 3, 2025Updated 5 months ago
- Model Merging with SVD to Tie the KnOTS [ICLR 2025]☆91Apr 3, 2025Updated last year
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆53Jan 29, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch☆78Feb 9, 2023Updated 3 years ago
- Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"☆511Mar 7, 2023Updated 3 years ago
- Codebase for Merging Language Models (ICML 2024)☆866May 5, 2024Updated 2 years ago
- Editing Models with Task Arithmetic☆539Jan 11, 2024Updated 2 years ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆31Apr 19, 2024Updated 2 years ago
- Codes for the paper "Optimizing Mode Connectivity via Neuron Alignment" from NeurIPS 2020.☆16Dec 10, 2020Updated 5 years ago
- ☆19Feb 15, 2023Updated 3 years ago
- Official PyTorch Implementation of "Learning to Learn with Generative Models of Neural Network Checkpoints"☆346Oct 3, 2022Updated 3 years ago
- Codes for Merging Large Language Models☆36Aug 7, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆79Mar 1, 2025Updated last year
- an official PyTorch implementation of the paper "Partial Network Cloning", CVPR 2023☆13Mar 21, 2023Updated 3 years ago
- Model Fusion via Optimal Transport, NeurIPS 2020☆154Nov 16, 2022Updated 3 years ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆31Jun 7, 2024Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 10 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆143Mar 17, 2025Updated last year
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆93Jul 25, 2023Updated 2 years ago
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Jul 27, 2022Updated 3 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition☆670Jul 22, 2024Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Apr 30, 2023Updated 3 years ago
- Package to align tokens from different tokenizations.☆16Mar 25, 2024Updated 2 years ago
- Tools for merging pretrained large language models.☆7,052Mar 15, 2026Updated last month
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆37Sep 19, 2022Updated 3 years ago
- Proportional Amplitude Spectrum Training Augmentation for Synthetic to Real Domain Generalization☆22Mar 11, 2024Updated 2 years ago
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆219Apr 27, 2026Updated last week
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- Personal implementation of ASIF by Antonio Norelli☆26May 24, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆112Jun 8, 2023Updated 2 years ago
- A benchmark suite for Scalable Diverse Model Selection for Accessible Transfer Learning from our NeurIPS 2021 paper.☆15Dec 14, 2022Updated 3 years ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 2 months ago
- Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)☆15Apr 27, 2026Updated last week
- Code repository for the c-BTM paper☆109Sep 26, 2023Updated 2 years ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,419Nov 29, 2024Updated last year
- ☆12Oct 2, 2023Updated 2 years ago